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INTRODUCTION 



Testing is a part of the student learning process. The development of a good test 
containing true-fdse, multiple-choice, matching, and essay items is a difficult and 
time-consuming task. Many classroom teachers have received inadequate instruction 
in planning and developing tests that accurately measure student achievement. 

How to Write Tests for Students offers guidelines, practical suggestions, and 
examples for constructing better tests. It is divided into six chapters: Planning 
Tests, Developing Tests, Assembling and Administering Tests, Interpreting Test 
Results, Assessing Test Items, and Impioving Test Validity and Reliability. Each 
chapter concludes with review questions and activities, which apply the information 
presented. In addition, the publication contains four apnendixes: a description of 
hierarchical levels for the affective, cognitive, and psychomotor domains; sample 
verbs for writing instructional objectives; a listing of selected computer software re- 
sources for test generation and analysis; and sample test items from various curricu- 
lum areas. Also included are a glossary of terms and a comprehensive bibliography. 

The most effective way to use this book is to read it from beginning to end and 
to review the sample test items (Appendix D) for the appropriate curriculum area. 
Although How to Write Tests for Students is structured sequentially, some readcr.5 
may choose to skim certain chapters and concentrate on others. In either case, the 
publication will be useful to all middle and secondary school teachers and adminis- 
trators, as well as to those preparing to enter the teaching profession. 



Chapter 1 

PLANNING TESTS 



Classroom teachers usually assess student achievement at the end of an instruc- 
tional unit. Developing a test that accurately measures achievement requires careful 
planning. This planning includes an examination of student behaviors previously 
identified in the instructional objectives. 

INSTRUCTIONAL OBJECTIVES 

Classroom teachers are responsible for writing objectives that describe the beha- 
viors expected of students. An objective is a communication device that specifies 
the knowkuje, skills, and attitudes expected of students at the end of an instruc- 
tional unit. Teachers should write and use objectives because they (1) identify ex- 
pected outcomes for students, and provide tnformarion for administrators and par- 
ents as well; (2) form a basis for sequencing instrucrional content; and (3) establish 
a basis for evaluating student achievement. Some teachers write objectives with 
three components: (1) conditions, (2) performance, and (3) criteria. Others write 
objectives that explicidy state only the performance expeaed of students; they do 
not include a condition or criterion. 

A condition identifies important resources or restrictions. 

A performance specifies an observable behavior or competency. 

A criterion specifies a standard for satisfactory performance. 

^ ^ " 1 

EXAMPLE 1.1 

Objective Format 

Given (specify condition ), the student will (insert performance state- 
ment ) with performance satisfactory it (specify criterion ). 

Objective (containing condition^ performance^ and criterion) 

Given a plant taxonomy , the student will identify at least five of seven 
leaves collected during a class field tnp. 

Key: condition performance criterion 



EXAMPLE 1.2 

Objective (with performance only) 

The student will solve equations with one unknown. 



Objectives are classified in the three learning domains: affecrive, cognidvr, and 
psychomotor. Each domain has levels or subdivisions that require increasingly more 
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complex student behaviors. Appendix A provides a complete desaiptioo of the hi- 
erarchical levels for all three learning domains. 

Affective objectives emphasize feelings, emotions, attitudes, and values. The 
major levels (from lowest to highest) are receiving, responding, valuing, organiza- 
tion, and characterization by a value complex. 



EXAMPLE 1.3 

Objective {affective domain) 

The student will establish cooperative working relationships with 
peers. 



Cognitive objectives emphasize intellectual tasks. The cognitive domain includes 
the largest proportion of educational objectives. These vary from simple recall of 
material to synthesizing and evaluating new ideas. The major levels (from lowest to 
highest) are knowledge, comprehension, application, ar lysis, synthesis, and 
evaluation. 



EXAMPLE 1.4 

Objective (cognitive domain) 

The student will write an essay on the pros and cons of capital 
punishment. 



Psychomotor objectives emphasize skill development and require neuromuscular 
coordination. Most objectives in this domain are used in physical education or trade 
and technical areas. The major levels (from lowest to highest) are perception, set, 
guided response, mechanism, complex overt response, adaptation, and origination. 



EXAMPLE 1.5 
Objective (psychomotor domain) 
The student will weld ferrous and nonferrous metals. 



Since most instructional objectives are in the cognitive domain, it is imperative 
that teachers understand the levels in this domain in order to develop appropriate 
test items. 

Knowledge objectives require students to remember facts, principles, terms, cus- 
toms, traditions, categories, rules, formulas, procedures, and criteria. 



I EXAMPLE 1.6 

I Objective (cognitive domain— knowledge level) 
I The student will match artists with paintings. 
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Comprehension objectives require students to describe, illustrate, summarize, in- 
terpret, or note consequences of ideas or methods. 



I 

EXAMPLE i.7 

Objective (cognitive domain— comprehension level) 
The student will translate Spanish into English. 



Application objectives require students to apply previously learned ideas, meth- 
ods, or procedures in a setting that is different from the original learning 
experience. 



EXAMPLE 1.8 

Objective (cognitive domain— application level) 

The student will calculate the perimeter of a nine-sided figure. 



Analysis objectives require students to separate ideas into constituent parts in or- 
der to note relationships between the parts and the whole. 



EXAMPLE 1.9 

Objective (cognitive domain— analysis level) 

The student will compare the responsibilities of the three branches of 
government of the United States. 



Synthesis objectives require students to formulate new thoughts from previously 
learned material. 



EXAMPLE 1.10 

Objective (cognitive domain— synthesis level) 

The student will develop a procedure for testing hypotheses. 



Evaluation objectives require students to judge (assess) the value of ideas, meth- 
ods, or procedures using appropriate criteria. 



EXAMPLE 1.11 

Objective (cognitive domain— evaluation level) 

The student will critique two oral speeches presented in class. 
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Appendix B contains lists of sample verbs u^ed to identify specific student beha- 
viors and verbs used in various curriculum areas. These lists should be helpftil to 
teachers when writing objectives that describe observable behaviors in the three 
teaming domaias. 



Test items should directly relate to specific behaviors identified in the instruc- 
tional objectives. They should be written to reflect the specific domains and levels 
identified by the objectives. For example, if the objective required the students to 
synthesi2e, a test item that required only comprehension would be inappropriate. 
Synthesizing and comprehending do not require the same behavioral performance; 
consequently the objective and test item would not correspond. Test items should 
also assess the student's ability in a particular domain. For example, if the objective 
required the student to repair a ten-speed bicycle, a test item that required the stu- 
dent to list the repairs would be inappropriate. Repairing is a psychomotor activity, 
whereas listing is a cognitive activity at tne lowest level. 

All secondary curriculum areas place varying degrees of emphasis on instructional 
objectives in the three domains. Most teacher-wrinen objectives, however, are usu- 
ally classified in the three lowest levels of the cognitive domain. Attainment of ob- 
jectives in these levels is appropriately assessed by true-false, multiple-choice, and 
matching test items. Objectives in the three highest levels of the cognitive domain 
are usually assessed by essay test items or more complex choice-type items. 

Knowledge test items require recall or recognition of information in substantially 
the same form as the instruction. Rephrasing, inversion of sentences, and similar 
form changes do not remove a test item £tom this level. 

Comprehension test items require interpretation of a principle in a maimer that 
differs fiom the instruction, but that implies the principle. 

Application test items require use of a previously learned principle in a new set- 
ting. These items differ from comprehension items in that tiiey do uot imply the 
required principle. Examples cf such items are quantitative problems in mathemat- 
ics and science. 

Higher-level test items in the cognitive domain require analysis, synthesis, or 
evaluation of complex situations with appropriate generalizations or inferences. 



EXAMPLE 1.12 

Objective (cognitive domain— knowledge level) 

The student will match chemical elements with abbreviations. 




Matching Test Items 

Match the chemical elements found in column I with the abbrevia- 
tions from column II. 



TEST ITEMS 



I 



n 



C 1. Copper 
i 2. Gold 
3. Mercury 



-iL 4. Silver 
^ 5. Tin 



B. Au 

C. Cu 

D. Hg 

E. Me 

F. Si 

G. Sn 



A. Ag 
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EXAMPLE 1.13 

Objective (cognitive domain— comprehension level) 

The student will identify the don: inant **tone" of a poem. 

Multiple-Choice Test Item 

The tone of Cununings' poem Among Crumbling People is 

A. jaunty and bantering. 

B. sarcastic. 

C. siUy. 
*D. sorrowful. 



EXAMPLE 1.14 

Objective (cognitive donmn— application level) 

The student will calculate the area of a circle given the circumference. 

Multiple-Choice Test Item 

If the circumference of a circle is 44 centimeters, the area in centime- 
ters squared iz 
A. 78 
♦B. 154 

C. 308 

D. 1.078 



EXAMPLE 1.15 

Objective (cognitive domain—analysis level) 

The student will compare and contrast research designs. 

Essay Test Item 

Compare snd contrast experimental research, quasi-experimental 
resc«a wh, \ ex post aao research designs in terms of 

• methodology, 

• appropriate analysis techniques, and 

• interpretation and generalization of results. 



The final step in test planning is to determine the appropriate number of test 
items to write for each level of the cogniuve domain. This rrquires a careful analy* 
sis and dassificarion of the instmctional objectives. If. for example, the analysis re* 
veals that 50 pevcent of the objectives are classified as knowledge, then 50 percent 
of the test items should assess recall of the facts and principles uught in the in* 
stmctional unit. If th ' analysis indicates that the remaining objectives are classified 
as 25 percent comprehension. 15 percent application, and 10 percent higher levels, 
then appropriate test items should be written to reflect this iastrucrional emphasis. 
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REVIEW QUESTIONS AND AcllVliltS 
Ques/iom 

1. What are the three compoaents of an objective? 



2. What arc the subdivisions of the cognitive domain? 



3. What are the subdi^is^ns of the affective don:ain? 



4. What are the subdivisions of the psychomotor domain? 



5. What domain includes the largest proportion of educational objectives? 



6. What kind of test items (e.g., true-false, multiple-choice, matching, essay) are 
used to ass«*ss the highest levels of cognitive learning? 



7. What determines the appropriate number of items for a test in the cognitive 
domain? 



1. Underline and label the condition, performance, and aiterion of the following 
student objective: 

In a field setting, students will weld ly pipe that will withstand a 
3,000 psi pressure test. 

2. Match each type of learning in column I with the most appropriate domain in 
column II. 



Activities 



I 



n 



1. 

2. 

— 3. 
4. 

— y 



G>mprehending ideas 



A. Affective 

B. G)gnitive 

C. Psychomotor 



Applying principles 
Carving wood 



Valuing quality work 
Evaluating concepts 
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3. Write an objective that focuses on stradents becoming punctual workers. 

4. Write an objective that requires a student to accomplish a high-level psychoiao- 
tor task. 

5. Develop a multiple-choice test item to measure an application-level cognitive 
concept. 

6. Develop an essay test item to measure a high-level cognitive concept. 
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Qiaptcf 2 

DEVELOPING TESTS 



The most frequently used teacher-made tests to assess student achievement are 
true-false, multiple-choice, laatching, and essay. Each of these test5 has distinct 
advantages and disadvantages. True-false, multiple-choice, and ir^'^^hing tests are 
relatively easy to write; they sample a large amount of content, ana ^e quickly and 
objectively scored. These tests are frequently limited to facts, they encourage guess- 
ing, and they usually do not measure higher levels of cognitive learning (analysis, 
synthesis, and evaluation). Thr major advantage of the essay test is that it requires 
students to demonstrate attainment of instructional objectives in the higher levels 
of the cognitive domain. In addition, the essay test allows students to express them- 
selves in their own words. Its rriajor disadvantage is the subjectivity associated with 
rating student responses. 

Several general and specific guidelines will help teachets when they write any 
type of classroom test. 



GENERAL GUIDELINES 

The general guidelines for developing classroom tests include the following: 

• Test items should be directly related to the instructional objectives. 

• Test items should be clear and free of ambiguities. 

• Test items should use vocabulary appropriate to the educational level of the 
students. 

• Test items should be grammatically correct and free from spelling and typing 
errors. 

• Test items should be realistic and practical; that is, they should call for infor- 
mation that students must use. 

• One test item should not be based on the response to another, nor should it 
provide a clue to oti)er items. 

• Test items should minimize or avoid the use of textbook or stereotyped 
language. 

• Test items should cite authorities for statements that mig^^c be considered de- 
bauble or based on opinion. 

• Test items should avoid complex sentences. 

• Test items should use the amplest method for recording responses. 

• Test directions should ^ clear and complete. 



SPECIFIC GUIDELINES 

In addition to general guidelines, specific guidelines should be used in develop- 
ing true-fsdse, multiple-rhoice, matching, and essay test items. These guidelines, 
coupled with examples, are designed to provide te«^chers with a sense of direction 
for constructing appropriate test items. 
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True-False Items 



The true-false test consists of statements that are either true or false. The student 
must read the statements and choose one of two alternatives: true or false, yes or 
no, right or wrong, or plus or minus (+ or The fidllowing are guidelines for 
constructing true*false test items: 

• Each test item should focus on a single important idea. 

• A test item should be false because it contains an important concept that is 
incorrect, not because it contains an insignificant error. 

• Each test item should be written as an affutnative rather than a negative 
statement. 



1 EXAMPLE 2.1 




1 Poor T F 


James Madison was not a President of the United States. 


1 Good T F 


James Madison was a President of tlie United Sates. 


• Test items should never use double negatives. 


EXAMPLE 2.2 




Poor T F 


Watergate was not an instance of unauthorized entry. 


Good T F 


Watergate was an instance of unauthorized entry 



• Test items should be brief, but not at the expense of clarity. Avoid sentences 
in excess of 20 words because they are guessed true more often than £dse« A 
lengthy statement containing dependent clauses and phrases reduces the prob* 
ability that an item is false. 

• Test items containing specific citadons or enumerations should be used with 
caution. Such statements are more likely to be true than false. 

• Test items should not use specific determiners, such as all^ always^ exactly^ 
never, totally, entirely, completely, solely, fully, absolutely, exclusively^ only^ 
none, nothing, and alone, because they are likely to be false. 

• Test items should be true or false without qualification. Statements contain- 
ing qualifiers, such as sometimes, maybe, often, several, as a rule, should^ 
may, most, some, and generally, are more likely to be true than false. 

• Test items that are tme should not be consistently longer than those that are 
false. 

• Tests should consist of approximately equal numbers of true and false 
statements. 



EXAMPLE 2.3 
Sample True-False Items 
JL 1. X = 8 in the equation 3X + 6 = 18. 
JL. 2. The area of a rectangle $ cm wide X 8 cm long is 40 cm^. 
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NLultiple-Choice Item 



Each item in a multiple-choice test consists of a stem and alternatives (a correct 
response and several distractors). The stem presents a statement or question; the 
correct response provides a solution or answer. The distractors are incorrect choices 
that should attract students who have not mastered the material, yet should not 
confuse students who have attained mastery. 

There are several guidelines for writing stems and alternatives. 

Guidelines for Developing Stems 

The guidelines for developing stems of multiple-choice items include the 
following: 

* Stems should be stated as briefly and concisely as possible. 

* Stems should be concerned with only one central problem. 

* Stems should be direct questions or incomplete statements. 

* Stems should be stated positively. 

* Stems should include as much of the item as possible so that students need 
not reread the same material in each alternative. 



EXAMPLE 2.4 

Poor If a gas is compressed 

*A. its temperature increases. 

B. its temperature decreases. 

C. its temperature remains the same. 

D. its temperature sometimes increases and sometimes 
decreases. 

Good Compressing a gas causes its temperature to 
*A. increase. 

B. deaease. 

C. remain the same. 

D. fluctuate between increase and deaease. 



• Stems should not use *'a'* or "an'* as the final word if it serves as a clue to 
the correct response. 



EXAMPLE 2.5 

Poor Oak is a material that is an 
*A. open-grain wood. 

B. closed-grain wood. 

C. softwood. 

D. hardboard. 

Good Oak is a material that is a/an 
♦A. open-grain wood. 

B. closed-grain wood. 

C. softwood. 

D. hardboard. 
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Guidelines for Developing Alternatives 

Multiple-choice item alternatives consist of three or four distractots (incorrect 
choices) and the correct response. The guidelines for developing these alternatives 
include the following: 

• Alternatives must include one choice that is clearly the best (correct) response; 
the remaining alternatives should appear plausible to the uninformed or par- 
tially informed student. An alternative should not be included solely for the 
purpose of humor. 

• Alternatives should be grammatically correct and consistent with the stem. 

• Alternatives should include the statements "none of the above'' or "all of the 
above" with caution. 

• Alternatives should use the appropriate punctuation marks. 

• Alternatives for each test item should be approximately equal in length. 

• Alternatives should be arranged so that the correa responses f^r all multiple- 
choice items occur in a random order. 

• Alternatives should be as brief as possible. 

• The format of the test should be consistent. 

^^^=^— =— — ^=^— — ™ 
EXAMPLE 2.6 
Sample Multiple-Choice Item 

1. The measure of central tendency computed when it is 
avoid the e£Fect of extremely high or low scores is the 

A. mean. 
*B. median. 

C. mode. 

D. variance. 

2. The probability of observing two heads and one tail i 
toss of three coins is 

A. 1/8. 

B. 2/8. 
*C. 3/8. 

D. 5/8. 




Matching Items 

There are times when matching test items are most appropriate for assessing stu- 
dent achievement in an instructional unit. Matching tests require students to re- 
spond by pairing statements or words in two columns of related material. The typi- 
cal matchmg exercise consists of a list of statements in column I (primary column) 
and a list of responses in column II (response column). 

Specific guidelines for developing matching test items include the following: 

• Directions should clearly describe the contents of columns I and II and the ba- 
sis for matching. 

• Matching exercises should consist of homogeneous material (for example, his- 
torical events and dates). 

• The entire matching exercise should be on the same page. 

• Responses in column II should be arranged in some systematic marmer: alpha- 
betically, chronologically, or, in the case of numerical responses, in ascending 
or descending order. 



G>liimn II should contain only one correct match for each statement in col- 
umn I. 

In most caseSf column II should contain more responses than needed for 
matching with the statements in column I. 

The statements in column I should normally be longer than the responses in 
column II because the statements should serve as stems and the responses as 
alternatives. This provides a more efficient visual arrangement for students. 



EXAMPLE 2.7 
Sample Matching Item 

Directions: Match the inventions found in column I with the inventors 
in column II. 







I 




n 


c 


1. 


Atlantic cable 


A. 


Colt 


I 


2. 


cotton gin 


E. 


Edison 


F 


3. 


electric staner 


C. 


Field 


E 


4. 


sewing machine 


D. 


Franklin 


H 


5. 


steam engine 


E. 


Howe 


G 


6. 


wireless telegraphy 


F. 


Kettering 








G. 


Marconi 








H. 


Watt 








I. 


Whitney 



Essay Items 

Essay test items usually consist of questions that require the student to demon- 
strate attainment of iastructionai objectives in the higher levels of the cognitive do- 
main. Responses should indicate the student's ability to organize facts and ideas in 
a clear and meaningful way. 

A helpful resource for teachers preparing essay test items is Teaching Writing in 
the Content Areas: Senior High School (by Stephen N. Tchudi and Joanne Yates 
[National Education Association, 1983]). Emphasizing the relationship between 
written expression and understanding of content, these authors recommend design- 
ing essay examinations that make it easier for students to produce good writing, as 
well as to "display their content knowledge as fully and clearly as they are able." 
To accomplish this, they suggest giving students several options— a dialogue, a sto- 
ry, an on-the-scene report, an interview, a scenario — lather than confining them to 
an essay to answer such examination questions. According to Tchudi and Yates, 
most content knowledge is expressed clearly and fully in different ways. 

To help students organize their &cts and ideas, especially when time is limited, 
teachers can remind students to apply what they know about the writing process to 
the examination. For example, they should divide their available time into three 
parts: ''a brief time prewriting (thinking and plaiming), a longer time for writ- 
ing, and a short time for revision and copyeciting." Tchudi and Yates also offer 
specific strategies for each part of the writing process. 

Guidelines for developing essay questions include the following: 

• Indicate a limit (space, words, time) for eadi item. 

• State items clearly and identify specifically what the student is to accomplish. 

• Develop a sample answer (before administering the test) that indicates the 
main points expected in the response. 
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EXAMPLE 2.8 



Sample Essay Item 

During this class period, write an essay that explains your position on 
the issue of capital punishment. Consider botli sides of the issue and 
provide pro and con arguments you consider relevant. 

Points for Grading 



Pro Arguments: 

• Society must protca itself against violent criminals. 5 

• Society must esublish deterrents to crime. 3 

• Death is a just punishment. 3 

Con Arguments: 

• Death penalrics are unjust. 3 

• Execution is not a morally defensible response to crime. 3 

• Capital punishment does not deter crime. 3 

• Two individual counters and/or cririques to relevant pro/con 
argiiments. 10 

• Individual's position. 10 

Total 30 



REVIEW QUESTIONS AND ACTIVITIES 
Questions 

]. What are the advantages and disadvantages of true-false, multiple-choice, and 
matching test items? 



2. What are the advantages and disadvantages of essay test items? 

3. What are specific guidelines for writing true-false test items? 

4. What are specific guidelines for writing multiple-choice test items? 
3. What are specific guidelines for wridng matching test items? 

6. What are specific guidelines for writing essay test items? 
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Activities 

1. Write a true-false test item 

2. Write a mulriple-choice test item. 

3. Write matching test items. 

4. Write an essay test item. 



Chapter 3 

ASSEMBLING AND ADMINISTERING TESTS 



After carefully preparing test items, it is important that teachers give similar 
attention to assembling the items, preparing test directions, and administering the 
test. Inattention to these factors may adversely affect the test results. 



ASSEMBLING TEST HEMS 

To create better tests, teachers should (1) review instructional objectives, (2) se- 
lect or write items that assess these objectives, and (3) properly arrange the items in 
a final test form. 

To improve existing tests, teachers should review the items, rewriting those that 
(1) are unclear, (2) have poor (implausible) distractors, (3) are too easy, (4) are too 
difficult, and (3) have technical and/or grammatical errors. One way to go about 
this process is to ask a colleague to review the test and supply information about in- 
dividual items as well as the test as a whole. This "teacher review method" can 
provide information about item ambiguity and other flaws that may have gone 
unnoticed. 

There are several guidelines to follow when assembling test items: 

• Arrange items by type (all matching together, etc.). This technique provides a 
consistent response mode for students. Directions should precede different 
item types. 

• Arrange items according to instructional content. This technique allows stu- 
dents to see relationships among test items. It also provides the teacher an op- 
portunity to compare test content with instructional objectives. 

• Arrange items in order of inaeasing difficulty. This arrangement may help re- 
lieve test anxiety and enable students to proceed in a timely manner to later 
items. No easy test items should be unanswered because of time limitations. 



PREPARING TEST DIRECTIONS 

A good test includes directions that explain how to respond to the item(s). Poor- 
ly written directions can mislead and confuse students. Directions should state 
clearly and concisely what, how, where, and when students should answer. Some 
tests (e.g., true-£dse and multiple-choice) require simple directions; ethers (e.g., 
matching and essay) may require more complex directions. 

The guidelines for developing test directions include the following: 

• Use a clear, succinct writing style. 

• Create direcrions that stand out by using different type style or size, underlin- 
ing type, or using bold type. 

• Check directions for misunderstandings or inconsistencies. 

• Have colleagues read for clarity and understanding. 

• Use a parallel format for all test item sections. 

• Provide a sample test item with the directions. 

ERIC - f> OQ 



EXAMPLE 3.1 

DIRECTIONS: Circle the letter for the best answer to each question. 



Sample How many days are in a week? 

A. 3 

B. 5 




1 . How many centimeters are 
in one meter? 

A. 10 

B. 12 

C. 100 

D. 1000 



2 . How many millimeters are 
in one meter? 

A. 10 

B. 12 

C. 100 

D. 1000 



Sample directions for use with true-false, multiple-choice, and matching test 
items for both answer sheet and test booklet responses follow. 



True-False Directions 

Answer Sheet: The following statements are either tme or false. If you believe 
the statement is true, darken the fiist space on the answer sheet that corresponds to 
the item number with a #2 pencil. If you believe the statement is false, darken the 
second space on the answer sheet that corresponds to the item number. 



EXAMPLE 3.2 

1. Copper is a conductor of heat and electricity. 

Answer Sheet 

T F 

1. A B C D E 

— ------ ------ ------ ------ 



Test Booklet: The following statements ate tme ot false. If you believe the sme- 
ment is true, write T in the blank preceding the item number. If you believe the 
statement is false, write F in the blank preceding the item number. 



EXAMPLE 3.3 
JL 1. Copper is a conductor of heat and electricity. 
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Multiple-Choice Directions 



Answer Sheet: The following items include a statement or question with four al- 
ternatives, only one of which is correct. Indicate your selection of the alternative 
that correctly completes the item or answers the question by darkening the space 
below the corresponding letter on the answer sheet with a ttl pencil. 



EXAMPLE 3.4 

2. The ability to do work or perform some action is 

A. energy. 

B. kilowatts. 

C. power. 

D. wave motion. 



Answer Sheet 
2. A B 



Test Booklet: The following items include a statement or question with four al- 
ternatives, only one of which is correct. Indicate your selection of the alternative 
that correctly completes the item or answers the 4uestion by placing the corrc* 
sponding letter in the blank to the left of the item number. 



EXAMPLE 3.5 

J!L 2. The ability to do work or perform some action is 

A. energy. 

B. kilowatts. 

C. power. 

D. wave motion. 



Matching Directions 

Answer Sheet: The following matching items (in Example 3.6) have two col- 
umns. Column 1 describes four measurement instruments. Column II lists five tests 
or inventories. Match each description in column I with the test or inventory in col- 
umn II by darkening the space below the corresponding letter on the answer sheet 
with a #2 pencil. 

Test Booklet: The following matching items (in Example 3.7) have two columns. 
Column I describes four measurement instruments. Column II lists fivw :est5 or 
inventories. Match each description in column I with the test or inventory in col- 
umn II by placing the appropriate letter in the blank to the left of the item 
number. 
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EXAMPLE 3.6 






I 




n 


3. Provides a measure of personal 
likes and dislikes. 

4. Measures potential ability or 
capacity to learn various skills and 
acquire new knowledge. 


A. 
n 

c, 

D. 


Achievement test 
Apuiuae test 
Attitude measure 
General mental ability 


5. Measures amount of knowledge or 
skills child has acquired in partic- 
ular subject field. 


E. 


test 

Interest inventory 


6. Measures global capacity to act 
purposefully, think logically, and 
deal with enviroimient. 






Answer Sheet 






A B C D E 












4. r-- _i 






5 _ 






6. _ 

=^=^===^== 







EXAMPLE 3.7 



Sl. 3. Provides a measure of personal 
likes and dislikes. 

-i- 4. Measures potential ability or 
capacity to leam various skills 
and acquire new knowledge. 

Measures amount of knowl- 
edge or skills child has ac- 
quired in particular subject 
field. 

6. Measures global capacity to act 
purposefully, think logically, 
and deal with environment. 



II 



A. Achievement test 

B. Aptitude test 

C. Attitude measure 

D. General mental ability 
test 

E. Interest inventory 
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ADMINISmJNG TESTS 



When administering tests, teachers must provide all students with the opportu- 
nity to demonstrate attaiiiment of the instructional objectives. TherdForc, a careful 
consideration of the physical setting and the psychological £sutors that may affect 
students' test results is important. 

Physical Setting 

Since most tests are administered in classrooms, it is important that teachers at- 
tempt to provide an atmosphere conducive to successful test taking. Id«d testing 
conditions include the following: 

• Ample student space 

• Proprr lighting and ventilation 

• Comforuble temperature and reasonable air circulation. 

Some classroom conditions arc beyond the teacher's control; others, however, 
can be modified by a sensitive teacher. For example, if the afternoon sun is creating 
a Flare for certain students, adjusting the window covering could eliminate the 
problem. In other words, teacher sensitivity to students' reactions can improve the 
physical setting. 

Attentiveness to distractions fi:om hallways, adjacent rooms, or outside the biilld- 
ing is also important. Although teachers caimot anticipate many of these distrac- 
tions, they can avoid most of them by carefully selecting the best time to adminis* 
ter the test. 

Psychological Conditions 

The psychological atmos; here associated with testing is often overlooked even 
though these hcxots can affect test results as much as, if not more than, the physi- 
cal conditions. If students feel tense, overanxious, or pressured, their performance 
during testing may be hindered. Teachers should be especially alert to these factors 
so that testing situations may lEiciliute accurate assessment. 

Teachers can create a positive testing atmosphere by explaining the reason for 
the test and adequately preparing students. They can best accomplish this by re- 
viewing the instructional content to be covered by the test and providing students 
with sample test items. The nature of the test and the amount of time students 
need for preparation will dictate the best time for this orienutii n. It may be a 
week or more before the test, or it may be immediately before the test. Keep in 
mind, however, that, even with careful preparation, some sti'dents will still be ap- 
prehensive because of the evaluative nature of testing. 



REVIEW QUESTIONS AND ACTIVmES 
Questions 

1. What are the guidelines for assembling test items? 



2. Why is it iir^portant to write clear test diivCtions? 
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3. What physical setting conditions must be considered before administering a 
test? 



4. What psychological conditions must be considered before administering a cest? 



Activities 

1. Write a set of directions for true-false, multiple-choice, matching, and essay test 
items. 



2. Describe how the teacher should prepare a classroom b.^ore administering a 
test. 
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Chapter 4 

INTERPRETING TEST RESULTS 



All teachers are involved in scoring tests and interpreting results. Objectively 
scored tests (true-fidse, multiple-choice, matching) use items with only one correct 
answer; they should yield the same results no matter who scores the test. Subjec- 
tively scored tests (short-answer, essay) use items with several possible answers; dif- 
ferent results often occur when two or more individuals score these tests. 

Test results should be provided to students as feedback as soon as possible after 
the testing situation. To interpret test results, teachers can use raw scores, frequency 
distributions, ranks, percentile ranks, measures of central tendency, mcp'^ures of 
dispersion, and z-scores and T-scores. 



RAW SCORES 

A raw score is the actual score a student receives on a tc^t. Raw scores are usually 
easy to obtain and provide quick feedback; however, they are difficult to interpret 
and can be misleading. For example, a raw score of 64 on an English test, if consid- 
ered in relation to a traditional grading scale (90-100 = A, 80-89 - B, 70-79 - C, 
60-69 = D, and $9 and below = F) and without additional informaticm, would be 
interpreted as a D grade. If, on the other hand, the highest possible score on the 
test was 66, a raw score of 64 would represent 97% or an A grade. 



FREQUENCY DISTRIBUTION 

A frequency distribution is a tally of scores for a particular test arranged from 
highest to lowest. This display offers a degree of comparability by indicating how 
well ovz student did in relation to other students who completed the same test. 
Teachers use frequency distribution to present interpretative informaticm to parents 
and administrators. 



RANKS 

Ranks indicate the position of a score in relation to other scotts. They are used to 
clarify the meaning of a raw score on a test. Scores are assigned successive ranks 
from 1 to N begiiming with the highest score. If two or mor^ students received the 
same score on a test, the rank assigned to each student is determined by dividing 
the sum of the tied ranks by the number of students who received the score. For ex- 
ample, if two students received the highest score of 98 on a test, both students 
would be ranked 1.3 (ranks 1+2 divided by 2 students = 1.3 rank for both). If 
three students received the highest score of 98 on a test, all three students would 
receive a ranking of 2 (ranks 1+2 + 3 divided by 3 students - 2 rank for all 
three). 
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EXAMPLE 4.1 

Frequency Distribution 

Test Scores Frequency Rank 

95 
94 
91 
90 
89 
86 
75 
73 
72 



(6 + 7 + 8dividcd by 3) 
(9+ 10 divided by 2) 



FERCENTDJB RANKS 

A percentile rank indicates the percentage of students in a class who obtained 
raw scores below a given score. For example, a percentile rank of 90 for a raw score 
of 85 indicates that 90 percent of the students in the group tested had raw scores 
lower than 85, and 10 percent had raw scores higher than 85. A percentile rank can 
ue calculated using the frequoicy or rank of the score in the distribution. 



FORMULA 

Percentile Rank (using frequency) 

^ number of test scores below score H- % at score ^ 
N (# of students who completed test) 



Percentile Rank (using rank) 

_ N (# of students who completed test) H- >5 - rank ^ 
N (# of students who completed test) 



MEASURFS OF CENTRAL TENDENCY 

Measures of central tendency include the mean, median, and mode. The mean is 
the arithmetic average of test scores. It is calculated by adding all the test scores in 
a distribution and dividing the sum of these scores by the number of students who 
took the test. The median is the middle test score in a list of scores from highest to 
lowest. It is the exact point in a distribution of ranked scores at which one-half or 
50 percent of the scores fail above and below. The mode is the most frequent score 
(the score made by the largest number of students) in a frequency distribution. In a 
bimodal distnbu Jon two scores tie for most frequent occurrence. In a trimodal dis- 
tribution three scores tie for most £requent occunence. Distributions should be con- 
sidered bimodal or trimodal only i£ the frequency of the tied scores is more 
than two. 
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EXAMPLE 4.2 



Calculate the percentile tank for test score 73. 



Test Scores 

95 
94 
91 
90 
89 
86 
75 
73 
72 



Frequency 



Rank 

1 
2 
3 
4 

5 

7 

9.5 
11 
12 



1 



PR (frequency) = 



number of test scores below 75 + % at 75 
N 



X 100 



PR = 



2-H 
12 



X 100 



PR = — X 100 =25 
12 



„ , , » N + .5 - rank 

PR(rank) = X 100 

N 



PR = 



12 -t- .5 - 9.5 
12 



X 100 



PR = — X 100 =25 
12 



FORMULA 






Mean 








Mean 


- 2X 




^ "IT 




X = 


Mean 




2X = 


Sum of test scores 




N = 


Number of students who took the test 




NonnaJ Distribution Bimodal Distribution Trimodai Distribution 
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EXAMPLE 4.3 

Calculate the mean for the following test scores: 

Test Scores 
X 

81 
80 

79 
75 
72 
71 
70 
69 
67 
_66 

2X = 730 



EXAMPLE 4.4 

Calculate the median and mode for the following test score distribution: 



Test Scores 


Frequency 


Tally 


96 


1 


I 


94 


3 


m 


91 


5 




90 


5 




87 


6 


JHTI 


85 


8 


wrm 


82 


5 




79 


2 


II 


77 


2 


u 


76 


2 


n 


74 


1 


I 





Median = Middle point in a distribution of test scores 
Median - Middle point in a distribution of40 test scores 
Median = 86 (middle point between test scores 83 and 87) 

•Mode » Test score made by the largest number of students 
Mode « 83 (test score made by 8 students) 
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FORMULA 
Standard Deviation 

Standard Deviation « SD = 

SD = Standard Deviation 
2x* = Sum of squared differences the test scores are from the mean 
N = Number of students who completed the test 




EXAMPLE 4.5 




1 


Compute the mean and standard deviation for the following test scores: 


Test Scores 






X 


X 




45 


5 


25 


44 


4 


16 P 


42 


2 


^ 1 


41 


1 


1 1 


39 


-1 


1 1 


38 


-2 


4 1 


38 


-2 


4 1 


37 


-3 


9 1 


J6 


-4 


— 1 


2X = 360 




2x» = 80 1 



Mean = X = ^ = ^ = 40.0 
N 9 


Standard deviation = SD =• yj~^ ~J 


^ y = ^8^ = 2.98 



A normal curve represents a normal or symmetrical distribution of scores. Distri- 
butions may also be skewed. A skewed distribution is an asymmetrical distribution 
with a majority of scores at one end. In a negatively skewed distribution the longer 
tail of the distribution extends toward the negative (left) end. In a positively 
skewed distribution the longer tail extends toward the positive (fight) end. Skew- 
ness is determined by comparing the mean and median of a distribution of scores. 
When the mean and median are equal, the distribution displays symmetry. When 
the median and mean arc not equal, the distribution is positively or negatively 
skewed. If class s'ze is more than 25 • most score distributions are negatively skewed 
for well-construrced and valid classroom unit tests. 
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MEASURES OF DISPERSION 



The most common measures of dispersion used by teachers are the range and the 
standard deviation. The range is the difference between the highest and lowest 
scores on a test. For example, if the highest score on a test is 90 and the lowest 
score on the same test is 49. the range is 41 (90 - 49 = 41). 

The range is used with measures of central tendency to describe the distribution 
of scores. If the distributions of test scores for two classes have different ranges, the 
test performance of the class with the larger range is more variable (more widely 
spread out along the score scale) than the class with the snudler range. For example, 
consider two groups of students who completed a 100-point test. The mean score 
for each class is 75; however, one class had test scores from 55 to 95 or a range of 
40, and the second class had test scores ftom 65 to 85 or a range of 20. The test 
performance of the first class is more variable tlian that of the second class. 

Standard deviation is a measure of dispersion that indicates how scores within a 
distribution deviate from the mean. In a normal distribudon (bell-shaped curve), 
68 percent of test scores lie between one standard deviation above the mean (34%) 
and one standard deviation below the mean (34%). Ninety-six percent (96%) of 
test scores lie between two standard deviations above the mean (48%) and two 
standard deviations below the mean (48%). Approximately 2 percent of the scores 
lie between two and three standard deviations above the mean and 2 percent of 
scores lie between two and three standard deviations below the mean. 

Numerous human characteristics follow a normal distribution. For example, 
adult intelligence (Wechsler Adult Intelligence Scale) is disaibuted normally with a 
mean intelligence quotient (IQ) of 100 and a standard deviation of 15. Approxi- 
mately 68 percent of people have IQs between 85 and 115 (between one standard 
deviation above and below the mean). Approximately 14 percent of individuals 
have IQs between 115 and 130 (between one and two standard deviations above 
the mean), and approximately 14 percent have IQs between 70 and 85 (between 
one and two standard deviations below the mean). An additional 2 percent of indi- 
viduals have IQs between 130 and 145 (between two and three standard deviations 
above the mean) and 2 percent have IQs between 55 and 70 (between two and 
three standard deviations below the mean). 





Negatively 
Skewed 



Normal 
Distribution 



Positively 
Skewed 



Z-SCORES AND T-SCORES 

A Z'Score represents the number of standard deviations of a raw score from the 
mean in a distribution. The mean and deviation must be calculated befere the 2- 
scote is computed. All z-score distributions have a mean of zero and a standard de- 
viation of one. Thus, a student with a z-score of +1.5 is IVi stanckrd deviations 
above the mean. Conversely, a student witii a z-score of - 1.5 is 1 V4 standard devi- 
ations below the mean. 

A T-score also represents the number of standard deviations of a raw score ftom 
the mean. Unlike z-scores, T-scores do not use decimals or negative numbers. They 
have a mean of 50 and a standard deviation of 10. Thus, a student with a T-score 
of 65 is 1 Vi standard deviations above the mean. Conversely, a student with a T- 
score of 35 is 1 Vi standard deviations below the mean. To convert a z-score to a T- 
score, multiply the z-score by 10 and add 50. For example, a z-score of +2.0 is the 
same as a T-score of 70 (2.0 X 10 - 20 + 50 - 70). Negative z-scorcs can also be 
converted to T-scores. For example, a z-sc?*"<! of -2.0 is the same as a T-score of 30 
( 2.0 X 10 -2C f 50 30). 
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EXAMPLE 4.6 
Calailate the z-scote and T-score for test score 67. 
Test Scores 

X X X2 

81 8 64 

80 7 49 

79 6 36 

75 2 4 

72 -1 1 

71 -2 4 

70 -3 9 

69 -4 16 

67 -6 36 

_66 -7 _^ 

ZX = 730 Zx* = 268 

^ 2;X 730 
Mean = X = — = - 73.0 

Standard deviatio . = SD = J ~ J ~ J ^^-S = 5.18 

Raw score - Mean 67 -73.0 -6.0 

z-score = - — ; — r:T — — — =» = = -1.16 

Standard Deviation 5.18 5.18 

T-scorc = 10 (z-score) + 50 = 10 (-1.16) + 50 = 38 
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REVIEW QUESTIONS AND ACUVmES 
Questions 

1. What is a taw score? 

2. What is a frequency distribution? 

3. What is a percentile rank? 

4. What is the mean? 

5. What is the median? 

6. What is the mode? 

7. What is standard deviation ? 

8. What is the difference between a normal and a skewed distribution of scores? 
9' What is the difference between a normal and a bimodal distribution of scores? 

10. What are z-scores and T-scores? 
Activities 

1 . Describe the advantages of reporting student performance using raw scores , 
ranks, and frequency distribution. 

2. Calculate the mean for the following test scores: 



Test Scores 



Frequency 



100 
93 
92 
87 
86 
81 
77 
73 
70 
67 
66 
63 



2 
3 
5 
3 
8 
3 
3 
2 
5 
2 
1 
1 
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3. Calculate the median, mode, and range for the following test scores: 

Test Scores Frequency 

96 1 

94 3 

91 5 

90 5 

87 6 

8) 8 

82 S 

79 2 

77 2 

76 2 

74 1 



4. Calculate the mean and standard deviation for the following test scores: 

Test Scores Frequency 

95 1 

94 1 

91 1 

90 1 

89 1 

86 3 

75 1 

73 1 

72 1 



5. Calculate the z-score and the T-score for test scores 73 and 95. 

Test Scores Frequency 

95 1 

94 1 

91 1 

90 1 

89 1 

86 3 

75 1 

73 1 

72 1 




6. Determine the rank and percentile rank for each of the following test scores: 
Test Scores Frequency 

85 1 
84 1 
80 2 
75 5 
70 2 
65 2 
60 1 

38 

36 



The fallowing are raw scores from a test administered to a class of 20 students. 
Organize these scores into a frequency distribution and compute the 

a. mean« median, and mode; 

b. range and standard deviation; and 

c. 2-score and T score for each test score. 







Test Scores 






98 


81 


92 


87 


86 


86 


91 


71 


86 


75 


63 


72 


52 


86 


87 


77 


59 


91 


97 


82 



The following are raw scores from a test administered to a class of 30 students. 
Organize these scores into a frequency distribution and compute the 

a. mean, median, and mode; 

b. range and standard deviation; and 

c. 2-score and T-score for each test score. 







Test Scores 






33 


20 


28 


25 


21 


27 


14 


15 


36 


07 


24 


34 


25 


07 


17 


28 


21 


17 


21 


26 


40 


37 


09 


26 


22 


29 


17 


10 


42 


12 



37 
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Qiapter 5 

ASSESSING TEST ITEMS 



After scoring and returning a test, most teachers allow tirae for students to ask 
questions about specific items. This procedure provides the teacher with feedback 
about the entire test (directions, items, and scoring procedures), and it can also 
identify items that are ambiguous or otherwise flawed. Unfortunately, this informal 
feedback is often the only analysis teachers make before administering the test 
again. The entire test is more systemaucally appraised by using three t«:hniques: 
analyzing an item response profile, calculating item difficulty, and determining 
item discrimination. 



RESPONSE PROFIIE 

Determining the appropriateness or effectiveness of each test item requires an 
analysis of student responses. A response profile provides an index of the frequency 
of student responses to test alternatives. It is particularly valuable in identifying 
how well distractors work in multiple-choice items. 

Suppose a class recently completed a test and, in addition to student feedback, 
the teacher desires more detailed information about each item. Examples $.1 
through $.3 provide sample test items and response profiles for a class of 2$ stu- 
dents. These examples illustrate the use of a response profile to determine how well 
each alternative functions in a muluple-choice item. 



EXAMPLE $.1 
Test Item 

If the odds in £sLvor of an event occurring are 6 to 1, the probability of 
this event occuning is 



A. 
B. 
♦C. 
D. 



1/7 
1/6 
6/7 
1/13 



Response Profile 

A B ♦€ D 
4 6 n 0 



(Alternatives) 

(# of student responses) 



The response profile for the item in Example $.1 shows that no students re- 
sponded to alternative D. This should immediately indicate to the teacher that al- 
temative D needs revision if it is to be retained. 
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EXAMPLE 5.2 
Test hem 

On a true-false test of N itenx, the test score (A) is coiripdted as the 
number right (R) minus the number wrong (W). If Joe responds to all 
th items and R are right, what score is recorded for Joe? (A, N, R) 

*A. A = 2R - N 

B. A = R - (R - N) 

C. A =-- R/N 

D. A = R - N 



L 



Response Profile 

*A B C D (Alternatives) 
8 !} 6 6 (# of student responses) 



The response profile for the item in Example J.2 indicates that the item is faulty 
since almost equal nmnbets of students r^ponded to the correct answer and the 
three distractots This item should be examined for complexity as well as for appro* 
priate relationship to the instructional objective(s). 
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EX. MPLE 5.3 
Test Item 

If you are involved in an automobile accident and an injur/ occurs, 
the first place you should call is the 

A. automobile repair center. 

*B. hospitol. 

C. insurance company oflfice. 

D. police St on. 



1 Response Profile 




A ♦B C D 


(Alternatives) 


1 2 8 3 12 


{ft of ident responses) 



Distractor D in example 3.3 should be carefully examined, smce more students 
responded to it than to the keyed (*) ;Jtemative. 



rnEMDIFHCULTY 

Ue n (fifp.culty indicates the proportion of students who responded correctly to a 
test item. The difficdty of an iten^ is expressed on a scale from 0.00 to 1.00. A val- 
ue of 0.00 indicates that no students responded correctly; a value of 1.00 indicates 
that all students responded correctly. For example, a oifficulty index of .70 indi- 
cates that 70 percent of the students responded correctly to a paritcular item. Most 
test developers recommend a .30 to .70 difficulty range with an average item d^- 
culty of .30. Test items with difHculty indexes above .70 are easy and those with 
difficulty indexes below .50 sire diBicult. 




Item difficulty is calculated by the following formula: 



FORMULA 



1 ^ <>f Students respondmg correctly to test item 

Item difficulty = — ^— 

it of students responding to test item 



Examples 3.4 and 3.3 provide sample test items with a response profile and an 
item difficulty calculation tor a class of 23 students. 



EXAMPLE 3.4 
Tesfhem 

The design principle that stresses the quality of equilibrium is 

*A. balance. 

B. proportion. 

C. unity. 

D. variety. 



Response Profile 

♦A B C D (Alternatives) 
18 3 2 2 (If of student responses) 



hem Difficulty 

Item difficulty = ^ = .72 



Example 3.4 identifies a relatively easy item, since 72 percent of the students re- 
sponded correctly item difficulty of .72 indicates that apptoximately three- 
fourths of die class demonsuated attairunent of the instmctional objective assessed 
by the item. 
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EXAMPLE :).5 
Te^t Item 

A symbol in poetry 

"^A. stands for something else. 

B. is a simile or metaphor. 

C. is easy to understand. 

D. appeals to the ear. 



0 Response Profile 

1 ♦A B C D 
1 8 3 6 6 


(Alternatives) 

(# of student responses) 


Item Difficulty 




Item diflElculty » 





Example 5.5 shows an item diflGculty of .32, indicating a diflGcult item. Accord- 
ing to the response profile, students responded randomly. Ideally, a test should sel- 
dom contain items with dij65culties below .30. 



ITEM DISCRIMINATION 

In addition to analyzing the item response profile and calculating the item difi5- 
culty, teachers can use a third technique to assess test items. Item discriminatio n 
provides an index of how an item discriminates between h^gh- and low-scoring 
students. 

Test items can indicate positive, negative, or no discrimination, expressed on a 
scale from +1.00 to —1.00. Positive discrimination occurs when more high- than 
low-scoring students respond correctly to a test item. Maximum positive discrimina- 
tioi occurs when all high-scoring students respond correctly to the item and no 
low-scoring students succeed on it. The discrimination for such an item would be 
+ 1.00. Negative discrimination occurs when more low- than high-scoring students 
respc^d correctly to a test item. Such items often have serious problems and must 
be revised or discarded. Maximum ungative discriminatior xcurs when all low-scor- 
ing students respond correcdy to the item and no high-scoring students succeed on 
it. The discrimination for such an item would be -1.00. This is a rare situation. 
No discrimination occurs when an equal number of high- and low-scoring students 
respond correctly to a test item. The disaimination for such an item would be 
0.00. A test item with no discrimination ^ould be reviewed and in all likelihood 
rewritten because it dors not differentiate between the two groups. 

To determine item discrimination, follow these steps: 

^ Arrange test scores in order ftom highest to lowest. 

^ Determine the number of students (K) to use in the hi^- and low-scoring 
groups. The number of students who completed the test will determine the 
percentage of the class to use for K. The'foUowing percentages are recom- 
mended: K = 50% if 20 or fewer students completed Ac test, K = 33% if 21 
to i9 students completed the test, and K = 25% if 40 or more students com- 
pleted the test. 
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• Detetmine the number of students from the high-scoring group who respond- 
ed correctly to the test item (H). 

• Determine the number of students from the low-scoring group who respond- 
ed correctly to the test item (L). 

Item discrimination is calculated by the following formula: 



FORMULA 






H-L 


Item disaimmauon = — - — 

Iv 


K = 


$0% of the class if 20 or fewer students completed the test 


K = 


33% of the class if 21 to 39 students completed the test 


K = 


2$ % of the class if 40 or mote students completed the test 


H = 


# of high'Scoring students responding correctly to the item 


L = 


# of low-scoring students responding correctly to the item 



EXAMPLE 3.6 

Assume a class of 27 students (K = 9). The nxmiber of students from the 
high- and low-scoring groups who responded correctly to each item is in- 
dicated. Cx)mpute the item discrimination indr!X for each of the follow- 
ing test items: 

Item High-scoring Low-scoring 

1. 8 2 

2. 3 6 

3. 4 4 



In Example 5.6, for a class cf 27, 9 students (K = 33% of 27 or K = 9) were 
identified in the high- and low-scoring groups. Iteii 1 shows that 8 students from 
the high-scoring gtoup (H = 8) and 2 students from the low-scoring group (L = 2) 
responded corrrctly. Item 1 disr^tminates positively, since more students from the 
high- than from the low-scoring group responded correctly. 



Item dis^ximination = 



h-L 
K 



Item discrimination = 



8-2 



= .67 



Item 2 shows that 3 students from the high-scoring group (H = 3) and 6 stu- 
dents fxom the low-scoring group (L = 6) responded correcdy. Item 2 discriminates 
negatively, since more students uom the low- than from the high-scoring group re- 
sponded correctly. 



Item discrimination 



3-6 



= -.33 
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Item 3 shows chat 4 students from the high-scoring group (H - 4) and 4 stu- 
dents from the low-scoring group (L - 4) responded correctly. Item 3 shows no dis- 
crimination, since an equal number of students from both groups responded 
correctly. 

1. . . . 4-4 
Item disaimmation = — - — = .00 

9 



1 EXAMPLE 5.7 

Assume a class of 20 students (K 10) and compute the item discrimi- 
nation index for the following test item. Then desaibe what, if any, ac- 
tion should be taken to revise the item indicates correct response). 

Test Item 

Group Response Profile Disaimination Index 

♦A B C D 

High 10 0 0 0 10 -3 

Low 3 2 2 3 10 ^ 

Both 13 2 2 3 

M Action to be taken: This is a good item that discriminates positively. No 
I action is necessary. 



In example 3.7, for a class of 20, 10 students (K - 30% of 20 or K - 10) were 
identified in the high- and low-scoring groups. The item shows that 10 students 
from the high-scoring group (H ~ 10) and 3 students from the low-scoring group 
(L^ 3) responded correctly. This item discriminates positively, since more high- 
scoring than low-scoring students responded correctly. 



EXAMPLE 5.8 

Assume a class of 30 students (K - 10) and compute the item discrimi- 
nation index for the following test item. Then describe what, if any, ac- 
tion should be taken to revise the item (* indicates correct response). 

Test Item 

Group Response Profile Discrimination Index 

A *B C D 

High 8 10 1 1 - 4 

Low 3 4 0 3 10 ^ 

Both 11 3 0 4 



Action to be taken: This item is negatively discriminating (more stud^^nts 
fi:om the low- than from the high-scoring group responded correctly to 
it). More high-Scoring students responded to distractor A than to the cor- 
rect alternative. In addition, no high- or low-scoring students responded 
to alternative C. Rewrite or discard this item. 
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In example 5.8. for a class of 30. 10 students (K = 33% of 30 or K = 10) were 
identified in the high- and low-scoring groups. The middle 33 percent (10 stu- 
dents) were not used in calculating item discrimination. The item shows that 1 stu- 
dent from the high-scoring group (H 1) and 4 students from the low-scoring 
group (L =s 4) responded correctly). This item discriminates negatively, since more 
low-scoring than high-scoring students responded correctly. 



EXAMPLE 5.9 

Assume a class of 40 students (K = 10) and compute the item discrimi- 
nation index for the following test item. Then desaibe what, if any, ac- 
tion should be taken to revise the item (* indicates coriea response). 

Test Item 

Group Response Profile Discrimination Index 

A B C ♦D 

High 2 1 0 7 7 - 7 ^ 

Low 2 0 17 10 " 

Both 4 1 1 14 

Action to be taken: Hiis item has no discrimination (an equal nimiber of 
students from the low- and high-scoring groups responded correctly). Re- 
write or discard this item. 



In example 5*9» for a class of 40, 10 students 7^ = 25% of 40 or K 10) were 
identified in the high- and low-scoring groups. The middle 50 percent (20 stu- 
dents) were cot used in calculating item discrimination. The item shows that 7 stu- 
dents from the high-scoring group (H - 7) and 7 students from the low-scoring 
group (L - 7) responded correctly. This item has no discrimination since an equal 
number of students from the high- and low-scoring groups responded correctly. 

Many schools have computer software programs that analyze test items and pro- 
vide response profiles, and also calculate i\em difiEiculty and disaimination indexes. 
See Appendix C for a listing of software resources. Different computer programs 
use different percentages of students in high- and low-scoring groups to calculate 
an item discrimination index. The percentages suggested in this publication are 
based on class size (20 or fewer students = 50%, 21 to 39 stu'lents 33%, 40 or 
more students » 25%). 

Afrer scoring a test, teachers must analyze test items by reviewing response pro- 
fries and determining item difficulty and discrimination by hand or by using a 
computer pro^nun. Information about item difficulty and discrimination will assist 
teachers in i^nting better test items that will ultimately result i^ " more accurate as- 
sessment of student achievement. 
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Assume a class of 40 students (K = 10) and compute the item discrimi 
nation index for the following test item. Then describe what, if any, ac 
tion should be taken xo revise the item (* indicates correct response). 



Test Item 

The design principle that stresses the use of contrasting elements so 
controlled and placed as to hold and retain attention is 

A. balance. 

B. proportion. 

C. unity. 
♦D. variety. 

Response Profile 

A B C *D (Alternatives) 

4 10 5 (# of H responses) 

2 7 0 1 (# of L responses) 

Item Discrimination 

J. . . . 5-1 4 

Item disamimation - = = .40 

10 10 



Action to be taken: For a class of 40, 10 students (K = 25% of 40 or 
K = 10) were identified in the high- and low-scoring groups. The re- 
sponse profile reveals that the correct choice is alternative D and the item 
discrimination is positive, since 5 high-scoring students and 1 low-scoring 
student responded correctly. Further examination ^ows alternative A to 
be a poor distractor, since it attracted more high- than low-scoring stu- 
dents. Alternative C is ineffective, since it attraaed no students fiom ei- 
ther group. Alternative B is an effective distraaor because it attraaed 
more low- than high-scoring students. Replace alternative C and revise 
alternative A. 
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EXAMPLE 3.11 

Assume a class of 40 students completed the following test items. Inter 
pret the item difficulty and discrimination indexes for each item. 



Item Item Item 

Number Difficulty Discrimination 

1 .60 .30 

2 .40 .00 

3 .20 .30 

4 .10 -.10 

5 .90 .10 



Difficulty of items is interpreted as follows: 

• Items 1 and 2: Moderate difficulty (.30 to .70) 

• Items 3 and 4: High difficulty (.30 or less) 

• Item y Low difficulty (.70 or greater). 

Discrimination for items is interpreted as follows: 

• Item 1: High discrimination (.40 or higher) 

• Item 3: Moderate discrimination (.30 to .40) 

• Items 2 and 3: Very low discrimination (.20 or less) 

• Item 4: Negative discrimination (examine for revision). 



REVIEW QUESTIONS AND ACTIVITIES 
Questions 

1. What is a response profile? 

2. What is item difficulty? 

3. What is item discrimination? 
Activities 

1. Assume a class of 20 students and compute difficulty and discrimination index- 
es for the following three test items. ITicn describe what, if any, action should 
be taken to revise these items indicates correa response). 

Test Item Kl 

Group Response Profile Difficulty Index Discrimination Index 
A B C ♦D 

High 6 2 0 2 
Low 0 2 0 8 
Both 6 4 0 10 
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Anion to be taken for item #1: 



Test Item #2 

Group Response Profile Difficulty Index Discrimination Index 
A B *C D 

High 10 8 1 
Low 3 2 3 2 
Both 4 2 11 3 



Action to be taken for item #2: 



Test Item #5 

Group Response Profile Difficulty Index Discrimination Index 
*A B C D 

High 5 0 0 5 
Low 2 0 5 3 
Both 7 0 5 8 



Action to be taken for item ftl: 
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Chapter 6 

IMPROVING TEST VAUDITY AND REUABEJTY 



Classroom tescs should provide valid and reliable measures of student perfor 
mance. An understanding of the procedures used to determine validity and reli« 
ability is necessary to assess the test as a whole. 



VAUDITY 

Validity is the extent to which a test measures what it was intended to measure. 
It is the most important requisite of any test. Even if o^iier practical and technical 
considerations are satis£sictory, the test's quality is doubtful without supportive evi- 
dence of validity. 

The four major types of validity used with teacher-made tests are content, con- 
struct, concunent, and predictive. Each type requires a specific procedure and has a 
primary use. 



EXAMPLE 6.1 
Types of Test Validity 



Type 
Content 

Construa 

Concurrent 

Predictive 



Procedure 

Compares test items with 
instructional objectives 

Identifies underlying con- 
cepts measured by test 

Compares test with a simi- 
lar measure of present per- 
formance 



Primary use 
Assessment of content 

Assessment of adequacy 



Provision of substitute test 
for less convenient existing 
measure 



Compares test performance Selection and classification 
with future outcome of students 



Content Validity 

Content validity is the most common type of validation teachers use to ascertain 
if the test provides an accurate assessment of the instructional objectives. No em- 
pirical procedures are used to establish content validity. Rather, test items are indi- 
vidually analyzed and compared with the levels of behavior specified in the instmc- 
tional objectives. 

Teachers attempting to establish content validity should consult a reading level 
word list — a published list that indicates an average student's reading level tor a 
specific age or grade. Such a checking device can ensure that the vocabulary used in 
the test iter I is not an obstacle to students. 
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Construct Validity 



Construct validity identifies the psychological traits or underlying constructs of a 
tttt. Constructs are hypothetical qualities that are assumed to exist in order to ac- 
count for behavior in varying situations. Essentially, this type of validity asks the 
question, **What is the test actually measuring?" R)r example, in a test developed 
to measure a mentally retarded student's ability to use metric linear concepts (Ex- 
ample 6.2), the constructs might consist of (1) assigning and recognizing metric 
symbols, (2) assigning and recognizing metric values, (3) associative conceptualiza- 
tion of scale numbers to measurement lengths, (4) psychomotor ability to measure 
objects with a metric scale and conceptualize with length measurements, and (3) 
test-taking ability that would include the following: 

• Anentionspan 

• Familiarity with mechanics of test (following directions, etc.) 

• Test-wise ability (experience in test taking) 

• Transferability of answers to response sheet. 

j — 1 

EXAMPLE 6.2 
Factor Structure for Learning Metric Unear Concepts 



LEARNING METRIC LINEAR CONCEPTS 



Factor Stnictutc 



Assigning and 
Recognizing 
Sym bob 

1 



Assigning and 
Recognizing 
Vuucs 




Associative 
Conceptual 




Psychomotor 




Tcst-Taking 
Ability 


2 


3 


4 


5 




1 


1 


1 






Attention 
Span 




Mechanics 
of Test 




Test-Wise 
Ability 




Transfer- 
ability 



Miller, P. W. "The Effects of Selected Industnd Arts Activities on Educable Mentally Re- 
tardcd Students* Achievement and Retention of Metric Linear Concepts/* Doctoral disser- 
tation, The Ohio State University, 1977. Dissertation AbstmU J temational 38, 4627A. 
UnivcBlty Microfilms No. 7731934. 



Concurrent Validity 

Concureut vdidity compares a teacher-made test with a similar assessment mea- 
sire. A major reason for establishing concurrent validity is to substitute a test for a 
more time-consuming or more complex instrument. 

Predictive Validity 

Predictive validity compares tc:t performance with a future outcome-— for exam- 
ple, using ninth grade Scholastic Apritude Test scores to predict success in tenth 
grade high school chemistry. Both concurrent and predictive validity rely on corre- 
lation coefficients or an expectancy table to indicate the degree of relationship. 
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Suppose a teacher is interested in determining the usefuhiess of a mathematics 
aptitude test (administered at the beginning of the school year) to predia final test 
grades in Algebra I. To determine if the two measures are related, the teacher pre- 
pares an expectancy uble. 



EXAMPLE 6.3 
Expectancy Table 



Mathematics Aptitude Test Scores and 
Final Test Grades in Algebra I 

Mathematics Aptitude Test Scores 



Test 



2-5 


6-9 


10-13 


14-17 


18-21 


Grades 






1 


3 


3 


7A's 






10 


1 


4 


15 B's 




4 


6 


2 


1 


13 Cs 




4 


1 






5 D's 


1 


3 








4 Fs 


1 


11 


18 


6 


8 


44 



An examination of this expeaancy wble (Example 6.3) reveals that the grades on 
the final test are 7 A's, 15 B's, 13 Cs, 5 D's, and 4 Fs. Eight of the 44 students 
had the highest mathematics aptitude test scores (between 18 and 21); sbc students 
had aptitude test scores between 14 and 17. Of these 14 students receiving the 
highest scores, 11 (79 percent) received final test grades of A or B. All 14 students 
received final test grades of C or high . 

Further examination of the expectancy table bdicates that 7 of the 11 students 
(64 per^nt) with aptitude test scores between 6 and 9 had firud grades of D or F. 
In addition, 31 of 32 students (97 percent) with aptitude scores of 10 or greater had 
final test grades of A, B, or C. Analysis of the expectancy table indicates diat apti- 
tude test scores would be useful in predicting final test grades in Algebra L 



REUABIUTY 

Reliability provides an estimate of the a>nsistency of test results; it is expressed as 
a correlation coefficient reported on a scale ranging from 0.00 1.00. Next to va* 
lidity, reliability is the most important charaaeristic of a test. Without reliability, 
little confidence should be placed in test results. A test that is not valid, but highly 
reliable, may measure something inelevant with great precision. Teachers should 
not, however, expea test results to be perfectly consistent. As a test is administered 
to different students and /or groups, variations in test scores should be expected be* 
cause of £ictors other than quality (for example, fatigue, guessing). Generally 
however, if reliability is high, a second administration of a similar test to die same 
students should produce similar scores. If reliability is low, it is doubtfiil that two 
administrations of the same test would produce similar scores. 
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Reliability coefficients used with teacher-made tests are usually identified by the 
method used to calculate reliability. These methods are test-retest, equivalent 
forms, split-half, and Kuder-Richardson Formula 21. Each method follows specific 
procedures and provides evidence of consistency. 



EXAMPLE 6.4 

Types of Test Reliability 

Method Procedure 

Test-Retest Correlate scores for two ad- 
ministrations of same test. 
Second administration at 
later time (i.e., one month, 
one-half year). 

Equivalent Correlate scores for two 
Forms forms of test. 



Split-Half 



Kuder- 

Richsudson 
Formula 21 



Correlate scores for two 
halves of test. Apply correc- 
tion formula. 

Calculate mean and stan- 
dard deviation of test 
scores. Compute reliability 
coefficient. 



Consistency 

Stability of test results over 
tirr** 



Equivalency of forms 



Internal test consistency 



Internal test consistency 



Test'Retest 

Test-tetest is a method of establishing reliability by correlating scores from two 
administrations of the same test to the same group of students in z given time in- 
terval. The interval may vary fi:om a few days to several months, depending on the 
use of the results. If the results show the students' scores were approximately the 
same on both administrations of the test, then a positive relationship would exist 
(1.00 indicates a perfect positive relationship; 0.00 indicates no relationship). This 
method is seldom used to establish reliability for teacher-made tests because of the 
requirement of administering the test twice to the same students. 

Equivalent Forms 

Equivalent forms is a method of establishing reliability by correlating scores from 
two different, but equivalent, forms of a test. The two tests are administered to the 
same studentc and the scores are correlated. The correlation provides an estimate of 
how well both forms of the test the same instmctional objectives. This meth- 
od is seldom used to determine reliability of teacher-made tests because of the re- 
quirement of two forms of the same test and two testing sessions. 

Split'Half 

Split-half is a method of establishing reliability by correlating the scores from 
two halves of th^; same test. Usually the scores &om t^ie odd- and even-numbered 
test items are .inelated. The correlation coefficient provides a measure of inteinal 
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consistency and indicates the equivalency of the two halves in assessing the instmc- 
tional objectives. Finally, a correction formula (Spearman-Brown) is applied to the 
correlation coefficient, which esublishes a reliability coefficient for the entire test. 



FORMULA 




Spearman-Broum Formula 




Reliability of entire test 


_ 2 X Reliability on Vi test 
1 + Reliability on % test 



If, for example, the half-tests correlated .50, then the correlation coefficient of 
the whole test would be .67. 



EXAMPLE 6.5 



Reliability of entire test = 7^"^ 



1.00 
1.50 



= .67 



Kutkr-Richarebon formula 22 

The Kuder-Richardson Formula 21 (KR21) is a method of establishing reliability 
that requires a single administration c F a test and uses the mean and standard devi- 
ation of the test scores. 



FORMULA 

Kuder-Ricbardson Formula 21 (KR21) 

A / X (A - X) 



KR21 



A__ / X(A-X) \ 
-1 ^ A(SD)» f 



A (SD)» 

A = test length (# of items) 
X = mean of the test scores 
SD = standard deviation of the test scores 



If, for example, a distribution of test scores for a 50-item test has a mean of 30 
and a standard deviation of 6, the KR21 reliability is .68. 




EXAMPLE 6.6 
KR21 
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30 (50 - 30) 
50(6)» 



) 




The KR21 is easily determined if the mean and standard deviation of the test 
scores have previously betm calculated. This can be done with a hand-held calcula- 
tor or oiie of several statistic software prognuns used with microcomputers. 
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Improving Test Reliability 



Several test charaaeristics affect reliability. They include the following; 

• Test length. In g^«?ral, a longer test is more reliable than a shorter one be- 
aiuse longer tes^ sample the instructional objectives more adequately. Con- 
sidering this single foaor, teachers should strive for longer tests whenever cir- 
cumstances peratiit. 

• Spread of scores. The type of students taking the test can influence reliability. 
A group of students widi heterogeneous abiUty will produce a larger spread of 
test scores than will a group with homogeneous ability. 

• Item difficulty. In gencrU, tests composed of items of moderate difficulty (.30 
to .70) will have more influence on reliability than those composed mainly of 
easy or difficult item^. 

• Item discrimination. In general, tests composed of more discriminating items 
will have greater reliability than those com^ osed of less discriminating items. 

• lime limits. Adding a time £wtor may improve icliability for lower-level cog- 
nitive test items. Since ali students do not fimction at the same pace, a time 
^kctor adds another criterion to the test that causes discrimination, riius im- 
proving reliability. Teachers should not, however, arbitrarily impose a tim^ 
limit. For higher-level cognitive test items, the imposition of a time limit may 
-'tfcat the intended purpose of the items. 



REVIEW QUESTIONS AND ACnVITlES 
Questions 

1 . When several people correct and score a tc . with the same results, it is 

A. reliable. 

B. valid. 

C. comprehensive. 

D. objective. 

2. When a test measures what it is intended to measure, it is 

A. reliable. 

B. valid. 

C. comprehensive. 

D. objective. 

3. When a test is a consistent measure of student achievement, it is 

/a. reliable. 

B. valid. 

C. comprehensive. 

D. objective. 

4. Reliability is to vdidity as 

A. content is to construa. 

B. accuracy is to error. 

C. consistency is to truth. 

D. precision is to flexibility. 
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S. The story in which the size of the fish that was caught gets bigger and bigger is 
an example of good or bad 

A. reliability. 

B. validity. 

C. objectivity. 

D. comprehensiveness. 



Activities 

1. Desaibe the procedures used to demonstrate content validity. 



2. Describe the procedures used to demonstrate predictive validity. 



3. Describe the procedures used to demonstrate test-retest reliability. 



4. Describe the procedures used to demonstrate split-half reliability. 



3. Describe live test characteristics that affea reliability. 
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APPENDIXES 



57 

ERIC 



A. HIERARCHICAL LEVELS OF THE LEARNING DOMAINS 

Affective Domain^ 

1. Receiving 

Becomes aware; is willing to learn and try a particular response. 

2. Responding 

Initially may react out of compliance*, later out of willingness/satisfaction. 
(Requires Beceiving) 

3. Valuing 

The process of accepting the worth of an object, idea, or behavi ;i, attempting 
to promote it as a value; and developing commitment. 
(Requires Receiving and Responding) 

4. Organization 

Determining interrelationships of values; establishing a hierarchy. 
(Requires Receiving, Responding, and Valuing) 

5. Characterization by a Value Complex 

Generaliring selected values into controlling tendencies with subsequent inte- 
gration inco total »^hilosophy. 

(Requires Receiving, Responding, Vduing, and Organization) 

Cognitive Domain** 

1. Knowledge 

Ability to define terms, recall customs, traditions, trends, classifications, catego- 
ries, criteria, principles, generalizations. 

2. Comprehension 

Ability to explain, describe, illustrate, summarize, interpret, note consequences 
of ideas, procedures or methods. 
(Requires Knowledge) 

3. Application 

Ability to apply or use ideas, procedures, methods, 
(Requires Knowledge and Comprehension) 

4. Analysis 

Ability to break down, compaie, contrast, distinguish between, perceive, ana- 
lyze ideas into constituent parts. 
(Requires Knowledge, Comprehension, and Application) 

3. Synthesis 

Ability to put together, create, plan, develop, make, produce, reconunend, 
generalize, formulate a new idea, procedure or method. 
(Requires Knowledge, Comprehension, Application, and Analysis) 

6. Evaluation 

Ability to evaluate, assess the value of ideas, methods, using appropriate criteria. 
(Requires Knowledge, Comprehension, Application, Analysis, and Synthesis) 



•Krathwohl, D. R.; Bloom, B. S.; and Masia, B. B. Taxonomy of Educational Objectives: Handbook 
II Affective Domain, New York: David McKay, 1964. 

♦•Bloom, B. S.; Englchart, M. D.; Furst, E.J.; Hill, W. H.; uid Ktathwohl, D. R. Taxonomy of Edu- 
cational Objectives: Fandbook /, Cognitive Domain. New York: David McKay, 1956. 




Psychomotor Domain 

1. Perception 

Become aware through sense organs. Recognize cues, make choices, and relate 
to actions. 

2. Set 

Mental, physical, or emotional readiness. 
(Requires Perception) 

3. Guided Response 

Overt actions by limitation and/or trial and error under supervision. 
(Requires Perception and Set) 

4. Mechanism 
Habitual response. 

(Requires Perception, Set, and Guided Response) 

3. Complex Overt Response 

Action performed without hesitation, leading to automatic performance. 
(Requires Perception, Set, Guided Response, and MechaoL<?m) 

6. Adaptation 

Altering activities to fit new situations. 

(Requires Perception, Set, Guided Response, Mechanism, and Complex Overt 
Response) 

7. Origination 

Creating new actions to fi: a situation. 

(Requires Perceprion, Set, Guided Response, Mechanism, Complex Overt Re- 
sponse, and Adapurioa) 
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Simpson, E. ] "The Classification of Educationa' Objectives in the Psychomotor Domain." The Psy- 
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B. SAMPIEVmS FOR VmmC INSTRUCTIONAL OBJECTIVES 



Sample Verbs Used to Identify Specific Student Behaviors 



General Disciiminative Behaviors 

Qioose Describe Discriminate Indicate Match Place 

Collect Detect Distinguish Isolate Omit Point 

Define Differentiate Identify List Order Select 



Study Behaviors 



Arrange 


Classify 


Follow 


Look 


Organize 


Sort 


Categorize 


CompUe 


Formulate 


Map 


Quote 


Underline 


Chart 


Copy 


Gather 


Mark 


Record 




Cite 


Diagram 


Itemize 


Name 


Reproduce 




Circle 


Document 


Label 


Note 


Search 





Analysis Behaviors 

Analyze 
Appraise 
Combine 



Compare Criticize Evaluate Generate Plan 
Conclude Deduce Explain Induce Stnicture 
Contrast Defend Formulate Infer 



Creative Behaviors 

Alter 
Ask 

Change 
Design 



Generalize 
Paraphrase 
Predict 
Question 



Rearrange Rename 

Recombine Rco ier 

Reconstruct Rephrase 

Regroup Restate 



Restructure 

Retell 

Rewrite 



Simplify 

Synthesize 

Systematize 



Miscellaneous Behaviors 



Attempt 


Discover 


Giind 


Position 


Send 


Suggest 


Attend 


Disuibutc 


Hold 


Present 


Serve 


Supply 


Begin 


End 


Include 


Produce 


Sew 


Support 


Bring 


Eiase 


Inform 


Propose 


Share 


Switch 


Buy 


Expand 


Lead 


Provide 


Shatpen 


Take 


Complete 


Extend 


Lend 


Put 


Shorten 


Tear 


Consider 


Find 


Light 


Raise 


Shut 


Touch 


Correct 


Finish 


Make 


Relate 


Signify 


Type 


Crush 


Fit 


Mend 


Repeat 


Start 


Use 


Designate 


Fix 


Miss 


Return 


Store 


Vote 


Develop 


Get 


Offer 


Save 


Strike 





Claus, C. K. "Verbs and Imperative Sentences as a Basis for Stating Educational Objectives." Paper 
given at the meeting of the Nation?) G)uncil on Measurement In Education, Chicago, 1968. 
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Sample Verbs Used in Various Curriculum Areas 



Art Behaviors 

Assemble 

Guve 

Color 



G>nstnict 

Draw 

Fold 



lUustmte 

Melt 

Mix 



Mold 
Paint 
Paste 



Pour 
Press 
Roll 



Sculpt 
Sketch 
Trace 



Drama Behaviors 

Cross Display Exit 

Direct Enter Leave 



Move Pass Sit 

Pantomime Proceed Turn 



^^guage Behaviors 

Accent Hyphenate Pronounce 

Punauate 
Read 



Alphabetize Indent 
Edit Outline 



Speak Sununarize Verbalize 

Spell Syllabicate Whisper 

Sute Translate Write 



Laboratory Science Behaviors 

Calibrate Convert Dissect Increase 

Condua Decrease Feed Lengthen 

Connect Demonstrate Grow Limit 



Manipulate Report 
Prepare Specify 
Remove Weigh 



Mathematical Behaviors 

Add Divide GrapL 

Bisea Estimate Group 

Calculate Extrapolate Integrate 



Interpolate Prove Square 
Measure Reduce Tabulate 
Multiply Solve Verify 



Music Behaviors 

Blow 
Bow 
Qap 



Compose Hjm Piuck Sing Tap 

Finger Mute Practice Strum Whistle 

Harmonize Play 



Physical Behaviors 

Bend Hit 
Catch Hop 
Grasp Jump 



Kick Run 
March Skate 
Pitch Skip 



Somersault Swing 
Stretch Throw 
Swim Walk 



Social Behaviors 

Agree 

Answer 

Argue 



Compliment DissLgree Greet Join Participate 
Contribute Discuss Help Laugh Praise 

Cooperate Forgive Invite Meet Smile 



Claus, C. K. *'Verbs and Imperative Sentences as a Basis for Stating Educational Objeci^ves/* Paper 
given at the meeting of the National Council on Measurement in Education, Chicago, 1968. 
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C SEIMCTED COMPUTER SOFTWARE RESOURCES 



LRX Test (test-geoeration program) 

LcgK Extension Resources 

9i.'51 Business Center Drive, Suite C 

Rancho Cucamonga, CA 91730 

(714)980-0046 



MicroGrade (grade-keeping program) 
Chariot Software Group 
3659 India Street, Suite 100 
SanDiego,CA 92103 
1-800-242-746S 



MicroTest II (test-generation program) 
Chariot Software Group 
3659 India Street, Suite 100 
San Diego, CA 92103 
1-800-242-7468 



Score (test-scoring program) 
Flagstaff Engineering 
112'JKaibabLane 
Flagstaff, AZ 86001 
(602)779-3341 



Teacher's Aide (test-generation program) 

Jensen Enterprises, Inc. 

2979 Pershing Drive 

Coloma, MI 49038 

(616)849-2753 



Test Quest (test-generation program) 
Allen Communication 
5225 Wiley Post Way 
Salt Lake City, Ul' 841 16 
(801) 537-7800 
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D. SAMPLE TEST ITEMS 



This appendix prcsc.its sample test items from art, biology, chemistry, English, 
French, home economics, industrial education, mathematics, music, physical edu- 
cation, physics, science, social studies, and Spanish. These test items were contrib- 
uted by practicing teachers and organized by the authors to provide a consistent 
format. 



ART 

True-False Items 

-X. 1. Gauguin is best known for paintings of South Sea islanders. 

-L. 2. Complementary colors mixed together become brighter. 

JL 3. Degas painted and drew ballet dancers. 

^ 4. A 4-B drawing pencil has hard lead. 

JL 5. Toulouse-La Jtrec made posters to advertise entertainers. 



Midtipk-Choice Items 

1. An etching is an example of 

A. relief printing. 
*B. intaglio printing. 

C. lithographic printing. 

D. gravure printing. 

2. What artist is known for a series of paintings of the same subject in dififerent 
lights? 

A. Picasso 

B. Manet 
*C. Monet 

D. Rembrandt 

3- What artist made scient^^c drawings during the Renaissance period? 

A. Van Gogh 

B. Rubens 

C. Michelangelo 
♦D. daVind 

4. What artist is known for dripping paint on canvas? 
♦A. PoUock 

B. Cezanne 

C. Van Gogh 

D. Seurat 

3. Impressionists were a group of artists who usv ily 

*A. painted nature out-of-doors. 

B. painted still life in studios. 

C. made impressions from wood blocks. 

D. used ureams for object macter. 
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Matching Items 

T 

1. used for paste-ups 

Sl. 2. used for painting or drawing in color 

3. applied to canvas before oil painting 

4. mixed with oil paints 

3. board used to mix paints 



n 

A. turpentine 

B. water 

C. pastels 

D. palette 

E. rubber cement 

F. shellac 

G. gesso 



Essay Item 

Define line, volume, shape, texture, value, and color and describe how each de- 
sign element is used in a drawing. 

Points for grading 

• Design elements are defined (6 points). 

• Description is provided concerning how each design element is used in a 
drawing (6 points). 



Sample test items were prepared by Ray Schultz, fomier an teacher, Cape May County Vocational 
Schook, Cape May Coun House, NJ. 
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BIOLOGY 



True-False Items 



Plants ate capable of losing water by the process of dehydration through 
the vascular bundles. 

Diseases are changes fum the normal state that prevent specific organs 
from carrying out their functions. 

A substance that releases hydroxyl ions when dissolved in water or that 
can give off protons when reacting chemically is a base. 

During the dark stage of photosynthesis, the energy produced during the 
light phase is converted into sugar and oxygen through chemical reactions 
that do not involve light. 

A catalyst is a substance that stimulates the occurrence of a chemical reac- 
tion by lowering the activation tneigy requirement and that does not be- 
come a part of the final product. 



Multiple-Choice Items 

1. A basic determiner of heredity is 

A. amino acid. 
♦B. DNA. 

C. mutation. 

D. RNA. 

2. An example of an inherited gene mutation umans is 

A. exposure to radiation. 

B. replication of DNA. 
"^C. sickle-cell anemia. 

D. synthesis of proteins. 

0* A peptide bond forms betryeen 

*A. the carboxyl group of one amino acid and the amino group of another 
amino acid. 

B. the carboxyl groups of two amino acids. 

C. two amino acids with the addition of a water molecule. 

D. an amino acid and a carbohydrate with the loss of a water molecule. 

4. The probability of a family having a girl and two boys in that order is 

A. one-third. 

B. one-half. 

C. cne-sixth. 
♦D. one-eighth. 

The following diagram represents a trait for albinism. Individuals who are albi- 
nos are represented by filled-in circles. 
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m mile 
» female 



» $ $ ^T-O- 




3. From studying the preceding diagram, one can detennine that albinism 
*A. is a recessive trait. 

B. is a dominant uait. 

C. ail :cts only men. 

D. is passed on by only one parent. 

Matching Item 



Essctfltem 

Each year thousands of salmon return to the same freshwater spawning area 
where they had their beginning. How they know where to go is still a mystery, une 
suggestion is that hormones feund in the cells of the sahnon affea the existing 
osmotic conditions between the cells and the suitounding water. It is possible 
that the hormones secreted in the celk of the fish are synchronized with spawning. 
Discuss spawning migration in terms of existing osmotic conditions and ^e scac- 
tion of specific hormones. 

Points for grading 

• Relating answer to general comments about osmosis (2 points). 

• Relating osmosis to the conditions between salmon and sea water (3 



Sample test items were prepared by James Kelly, Malcolm Price Laboratoiy School, University of 
Northern Iowa, Cedar Falls. 



I 



n 



1. hemophilia 
— ~ 2. sickle-cell anemia and cystic fibrosis 
J5_ 3. Down's syndrome 



B. autosome recessive 

C. polyfactorial 

D. trisomy 

E. X-licked recessive gene 



A. autosome dominant 



points). 
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CHEMISTRY 
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True-False Items 

1. The chemical behaviors of sodium and cesium are similar. 

-f- 2. Since an atom is electrically neutrd, the number of protons is equal to 
the number of neutrons. 

3. The chloride ion consists of 17 protons, 18 neutrons, and 18 electrons. 

JL 4. The algebraic simi of the oxidation numbers of the atoms in the formula 
of a compound is zero. 

3. A covalent bond is formed in the diatomic hydrogen molecule. 

.JL 6. The oxidation number for magnesium and barium is 2+. 

7. Since an atom is electrically neutral, the number of electrons is equal to 
the number of neutrons. 

.-f. 8. The sodium ion consists of 11 protons, 11 electrons, and 12 neutrons. 

-J- 9- The oxidation number of chlorine in potassium chlorate (KCIO3) is 3+ . 

.JL 10. A double covalent bond is formed in diatomic oxygen. 

Multiple-Choke Items 

1. A vertical column of elements in the periodic uble is a/a' 

*A. family. 

B. octet. 

C. period. 

D. series. 

2. The element capable of replacing all others in tbr halogen family is 

A. bromine. 

B. chlorine. 
*C. fluorine. 

D. iodine. 

3. The oxidation number of phosphorous in the polyatomic ion P04^* is 

A. 3+ 
♦B. 5+ 

C. 2+ 

D. 4+ 

4. The number of moles of aluminum atoms reacting with three moles of oxygen 
molecules in the equation 4A1 + 3O2 2AI2 O3 is 

A. 1 

B. 2 

C. 3 
♦D.4 



3. The oumber of moles io ooe ki'. jgram of zioc is 
A. 14.1 
*B. 15.1 

C. 16.5 

D. 33.3 

6. A horizontal tow of elements in the petiodic table from sodium to argon is 
a/an 

A. family. 

B. octet. 
•C. scries. 

D. group. 

7. The oxidation number of sulfur in the polyatomic ion S04^~ is 

A. 2+ 

B. 4+ 
*C. 6+ 

D. 8+ 

8. The number of grams in 2.5 moles of carbon dioxide is 

A. 28 

B. 44 

C. 88 
*D. 110 

9. The number of moles of oxygen atoms reacting with four moles of aluminum 
atoms in the equation 4A1 + 3O2 — 2AI2 O3 is 

A. 2 

B. 4 
*C. 6 

D. 8 

10. The molarity of a solution containing 20.0 grams of sodium hydroxide dis- 
solved in 250.0 ml of water is 

*A. 2.0 
B. 1.0 

c. 0.5 

D. 0.25 



M^aching Items 



I 



1. a substance that does the dissolving 

2. two liquids that do not dissolve in each other 

3. a substance of low solubility 
_A_ 4. a method of purifying a substance 

5. the substance U:ing dissolved 



n 



A. distillation 

B. immiscible 

C. miscible 

D. precipitate 

E. soluble 

F. solute 

G. solvent 
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Essof Uem (Prollems) 

In the equation 2NaC103 2NaCl + 3O2, how many grams of NaClOj are 
required to produce one mole of O2? 

Points for grading 

• Answer: 70. 7g (3 points) 

In the equation 2HC1 + G1CO3 — CO2 + G1CI2 + H2O, how many giams of 
C CO3 are required to produce two moles of CC2? 

Point: for grading 

• Answer: 100 g (3 points) 

How many grams of O2 are needed to react with an excess of C4H10 ^0 produce 3 
moles of H2O? 

Points for grading 

• Answer: 208 g (3 points) 
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ENGLISH 

TruC'False Item 

-Z- 1. The setting for Shakespeare's Romeo and Juliet is 16th century England. 

JL 2. "Star-crossed lovei:s" means it was Romeo andjuliet's fate to die. 

JL 3. Benvolio is the peacemaker in the play. 

-1- 4. Romeo received a formal invitation to the masked feast. 

-Z- 5. Paris was at the tomb to give Juliet her wedding ring. 

-1- 6. Romeo was sentenced to death because this is what the prince had 
decreed. 

JH 7. A tragedy is a literary work in which the centnu character meets a disas- 
trous end. 

-1- 8. "Is now the two hours' traffic of our stage ..." is an example of a pun. 
/ iUltiple-Choice Itjm 

1. Tybalt's personality trait is described by Mercutio as being like a/an 

A. dog. 
♦B- cat. 

C. elephant. 

D. horse. 

2. In the Shakespearean plot structure, background information is revealed in die 

*A. exposition. 
5. complication. 

C. climax. 

D. resolution. 

3. A long uninterrupted speech while other characters rjre on stage is a/an 

A. aside. 

B. soliloquy. 

C. couplet. 
*D. monologue. 

4. According to Shakespearean plot structure, the climax cf this play occurs when 

♦A. Tybalt dies. 

B. Romeo dies. 

C. Juliet dies. 

D. Romeo and Juliet marry. 

? Why were the Monugues and Capulets feuding? 

A. Lord Montague once loved Lacty Gipuiet. 

B. Romeo loved Juliet. 

C. Lord Montague cheated Lord Capulet in business. 
*D. Shakespeare does not explain. 

6. Before taking the potion, /hat w;is Juliet's fear? 

A. It was a poison. 

B. She would regain consciousness alone in the tomb. 

C. She would see and hear her dead kinsmen. 
♦D. All of the above. 
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7. **Fof never was a story of more woe / Than this of Juliet for her Romeo " This 
is an example of a 
A. blank verse. 
*B. rhyme couplet. 

C. pun. 

D. prologue. 



Matching Item 







I 




n 


E 


1. 


Juliet's gift of love to Romeo 


A. 


gold statue 


H 


2. 


Rumco's tea! antagonist 


B. 


scenery 


A 


3. 


Lord Montague's gift to Juliet's memory 


C. 


wedding flowers 


C 
B 


4. 


Paris's gift at the tomb 


D. 


rope ladder 


5. 


backgrounds oa a stage that are not easily 


E. 


ring 






moved by hand 


D 


6. 


Nurse brings to Romeo after the marriage 


F. 


Mercutio 


F 


7. 


Romeo's best friend 


G. 


letter 


I 


8. 


movable item on siage 


H. 
I. 

J- 


fiiite 

prop 

Tybalt 



Essof Item 

Write a dialogue between the ghosts of Romeo and Juliet. Have them discuss 
their feelings toward Friar Laurence, Prince Escalus, and the Nurse. Also have them 
describe how they wish to be remembered. 

Points for grading 

^ Dialogue granunar, punctuation, and format (3 points). 

• Attitudes toward Friar Laurence, Prince Escalus, and the Nurse (30 
points). 

• How they wish to be remembered (5 points). 

Write a news obituary for Romeo or Juliet. Explain the details of their death, de- 
scribe two of their best personality traits, and conclude with a list of 10 of their sur- 
vivors or those who preceded them in death and how they are related. (Remember 
they were married, so the relationships arc more complex at the end of the play 
than at the beginning.) 

Points for grading 

• Obituar- grammar, punctuation, and format (5 points). 

• Details of death (5 points). 

• Two of their best personality traits (10 points). 

• Ten survivors or those preceding them in death and how they are related 
(20 points). 



Sample test items were prepared by Francie Blaney, former English teacher, Calumet High School, 
Gary, Indiana, and consultant for the Indiana Writing Project. 
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FRENCH 



True-False Items 

JL 1. Void un livrc. 

(Here is a book.) 

JL 2. Voici unc maison. 
(Here is a house.) 



Read the following French passage and answer the questions. 

Les parents de Pierre ne se sont pas levtfs aujourd'hui parce qu'ils se sent 
couches trSs tard hier soir aprds leur surprise-party. Quelle chance pour eux 
parce que c'est samedi et M. Dubois n'a pas besoin d'aller au bureau. 

(Pierre's parents did not get up early today because they went to bed very late 
lar night after their party. What luck for them because it is Saturday and Mr, 
Dubois does not need to go to the office.) 



Lcs patents dc Pierre sont encore couchtfs au lit. (Pierre's parents arc still 
in bed.) 

Us doivent tiavailler aujourd'hui. (They have to work today.) 

C'est samedi ct il y a une surprise party cc soir. (It is Saturday and there 
is a party tonight.) 

Multiple-Choice Items 



Read the following French passage and answer the qu&Uions. 

Madame Dupcnt est i la maison. Ce soir elle a d^id^ de preparer de la soupe 
i I'oignon pour le dmer. Ce matin elle a achettf des oignons au marchtf et main- 
tenant elle va prepare*- cette soupe ddicieuse pour sa femiUe, Toute sa famille 
aime la soupe i Toignon sauf son fils Jean-Luc. Sans doute il va manger des 
frites et un hot dog au lieu de cette soupe. 

(Mrs. Dupont is at home. She has decided to make onion soup for diimer this 
evening. She bought some onions at the market this morning and is now going 
to prepare this delicious soup for her family. The whole family likes onion 
soup except her son Jean-Luc. No doubt he will eat Froich fries and a hot dog 
instead of the soup.) 



1. OiJ est Madame Dupont? (Where is Mrs. Dupont?) 

A. Elle est au marchtf. (She is at^c market.) 

B. Elle est i table. (She is at the tabic.) 
*C. Elle est chcz elle. (She is at home.) 

D. Elle est dans la cuisine. (She is in die kitchen.) 




-i- 3. 

_L 4. 
^ 5. 
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2. Qu cst-cc qu'cUc va preparer ce soir? (What is she going to prepare tonight?) 

'^A. De la soupe i Toignon. (Onion soup) 

B. Des frites. (French fries) 

C. Des hots dogs. (Hot dogs) 

D. Un hot dog ddicieux. (A delicious hot dog) 

3. Est-ce que toute la famille aime cette soupe? (Does everyone like this soup?) 

A. Oui, tout le monde Taime. (Yes, everyone likes it.) 

B. Oui, mais pas son mari. Tes, but not her husband.) 
*C. Non, pas son fils. (No, not her son.) 

D. Non, sa fUle ne Taime pas beaucoup. (No, her -'lughter does not like it 
much.) 



Ess^ Items 

Describe in French a typical family meal at your house. 
Points for grading 

• Proper spelling and use of food and drink items (5 points). 

• Good composition (3 points). 

Desaibe in French an incident in your life when you were young. Be sure to pay 
special attention to the use of the imperfect and the past definite tenses. Include 
how the incident happened, where, why, and the consequences. This essay should 
be at least one page in length. 

Points for grading 

• The incident includes proper imperfea and past definite tenses (10 
points). 

• The incident identifies the how, where, why, and the consequences (10 
points). 

• The essay is at least one page in length (3 points). 
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SaiAple test items were prepared by Jsunes E. Brcker, Modem language Department, Malcolm Price 
Laboratory Scliool. University of NorUiem Iowa, Cedar Falls. 
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HOME ECONOMICS 



True-Falseltem 

•X. 1. Cottage cheese, yogurt, and ice cream are examples of choices from the 
milk group of the basic four food groups. 

JL. 2. Peanut butter, whole wheat bread, and ground beef are included in the 
meat group. 

-X- 3. One medium whole fruit or one-half cup of cooked vegetables equak one 
serving from the fruit/ vegetable group. 

-H 4. Citrus fruits include oranges, grapefruit, and tangerines. 

_X- 5. Four or more daily servings of foods from the bread and cereal group are 
needed. 



Multiple-Choice Items 

1. Which is a synthetic fiber? 

A. cotton 

B. linen 

C. alpaca 
*D. polyester 

2. Tabricsthat use tlie twill weave are 

*A. denim and gabardine. 

B. gabardine and nylon. 

C. gingham and nylon. 

D. nylon and linen. 

3. Which is a type of sleeve? 

A. mandarin 

B. Peter Pan 
♦C. raglan 

D. shawl 

4. Lines used in clothing design to create a feeling of height and slimness are 

A. diagonal. 

B. horizontal. 
*C. vertical. 

D. curvy. 

3. A nataral fiber is 

A. acrylic. 

B. rayon. 

C. triacetate. 
♦D. wool. 
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Matckag Items 

I 



II 



j£. 1. processing method for low acid foods 

2. processing method for high acid foods 

^SL. 3* meats and vegetables 

JEL 4. canning method recommended only for 



A. blanching 

B. boiling water bath 



C. high acid foods 

D. hot pack 



jellies 

5. fruits, pickle;^, tomatoes 



E. low acid foods 

F. open kettle method 



G. pressure cooker 

H. raw or cold pack 



Ess^ Items 

Explain the procedure for preparing a basic white sauce. Include ingredients and 
three or more uses. 

Points for grading 

• Procedure is clear and understandable (8 points). 

• Ingredients are correctly stated (6 points). 

• Three or more uses are given (6 points). 

Prepare a personal monthly budget, providing specific examples of fixed and 
flexible expenses 

Poif^ts for grading 

• Budget is clear, accurate, and understandable (10 points). 

• Specific examples are provided for fixed and flexible expenses (10 points). 



Sample test items were prepaied by Judith M. Tudcer, Bettsville Local Sdiools, Ohio. 
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INDUSTRIAL EDUCATION 



True-False Items 

-X. 1. A #4 wood screw is smaller in diameter than a #10 wood screw. 

JL 2. An abrasive paper grit size 120 is coarser than an abrasive paper grit size 
80. 

-E. 3. Plywood, particle board, and hardboard are purchased by board feet. 
JL 4. An 8d nail is 2%'' long. 

3. An auger bit stamoed 12 on the tang would bore a hole 7/8". 

Multiple-Choice Items 

1. A process of hammering and squeezing metal (usually heated) into a required 
shape is 

A. casting. 

B. extruding. 
*C. forging. 

D. machining. 

2. A metal casting process that produces a very fine finish is 

A. centrifugal. 

B. full mold. 

C. green sand. 
*D. investment. 

3. The slot that is made in wood when sawing is the 

A. groove. 

*B. kerf. 

C. kreft. 

D. rake. 

4. The two methods of drying lumb . are air drying and 

A. age drying. 

B. chemical drying. 

C. evaporation drying. 
*D. kiln drying. 

3. The solvent for shellac is 
*A. alcohol. 

B. lacquer thinner. 

C. turpentine. 

D. water. 



ERLC 



75 ^ 



Matching Items 

I 



II 
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A. cove 

B. lock dado 

F 2. I \ ^- ^^^^^ 

D. nosing 

E. ogee 

F. quarter-round 



JL 4. 



Essay Item 

Describe the difference between continuous and intermittent manufacturing. 
Provide three examples of products produced by each of these manufacturing 
processes. 

Points for grading 

• Key words (continuous and intermittent) are defined (19 points). 

• Three product examples for continuous manufacturing are given (10 
I Mnts). 

• Til. product examples for intermittent manu&cturing are given (10 
points). 



Sample test items wen prepared by Duane H. Rippe, Holmes Junior High School, Cedai Falb, Iowa. 
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MATHEMATICS 



True-False Item 



F 


1. 


For every positive oumber x, ^'is less than r 


T 


2. 


The expression 3 - 2 - 2 (3x - 7) is equal to -3x 4 12. 


T 


3. 


The product (3a + 5)(a - 1) equals 3a* + 2a - 5. 


F 


4. 


If triangle ABC is isosceles and the measure of angle A is 10 J", then B is 
100'. 


F 


5. 


Seven is a factor of 17. 



Multiple-Choice Items 

1. The diameter of one circle is 6 cm ar d the diameter of another circle is 12 cm. 
How many cm larger is the area of one circle than the other? 

A. 37rcm2 

B. 671" cm* 
*C. 277rcm2 

D. lOSTTcm* 

2 . A heptagon has how many sides? 

A. 5 

B. 6 
*C. 7 

D. 8 

3. What is the expanded product of (2x - 4) and (x + 1)? 

A. 2x^-4 

B. 2x* + 4 

C. 2x* + 2x + 4 
*D. 2x*-2x-4 

4. If9 — a = 0.5a, what doesa equal? 

A. - 4.5 
*B. 6.0 

C. 18 

D. -18 

5. If 2x - 5 = 7x - 15, what does x equal? 

A. 4 
*B. 2 

C. -4 

D. -2 

6. In a basketball game , Jessica made 7 5 percent of her 1 2 free throw attempts. 
Leanne made 60 percent of her TC free throw attempts. How many more free 
Jirows did Jessica make than Leanne? 

A. 1 

B. 2 
*C. 3 

D. 4 
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7. A park in the shape of a rectangle is 325 meters long and 100 meters wide. 
What is the measure of the perimeter of the park? 
♦A. S30 meters 

B. 750 meters 

C. 525 meters 

D. 425 meters 



Matching Items 







I 




II 


F 


1. 


3/4 


A. 


0.060 


C 


2. 


3/5 


B. 


0.500 


D 


3. 


5/8 


C. 


0.600 


A 


4. 


3/50 


D. 


0.625 


B 


5. 


13/26 


E. 


0.670 








F. 


0.750 








G. 


0.875 



Sample test items were prepared by John Tarr, Malcolm Price Laboratory School, University of Northern 
Iowa, Cedar Falls. 
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MUSIC 

True-False Items 



1 








T p p p p r p r ^ r P r' ' 


1 



-iL 1. The most liJccly meter for the example above would be 2/4. 

JL- 2. A performer will adjust the written dynamic markings to fit various per- 
formance conditions. 

-1- 3. Accurate tuning of one's instrument to concert B-flat assures that all oth- 
er notes will be played in tune. 

JL 4. A key signature containing three flats indicates that the music to follow 
must be in the key of E-flat. 

Multiple-Choice Items 

1. Which music direction is least likely to appear in the famous march "The 
Thunderer'' (Sousa)? 

A. tacet IX 

B. simile 

C. marcato 
'^'D. con sordino 

2. Which of the following sets shows tempo markings in a correct slow-to-fast 
order? 

A. andante-adagio-vivace 

B. allegro-andante-presto 
*C. largo-moderato-allegro 

D. andante-largo-presto 



with mutB 











jg » H J— 1 J 








vlth mt» 
IM ^ f 1 


,J J 
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3. The preceding example appears to be scored for which combination of instru- 
ments (in score order)? 
A. trumpet-horn- trombone 
*B. violin-darinet-cello 

C. flute-clarinet-string bass 

D. violin-viola-cello 

Matching Items 

You will hear rive short excerpts (about 4j seconds each) of unfamiliar cOiicert 
band pieces. Use the clues provided with each title in cclumn I to match the pieces 
with die letters in column n. 

I II 

1. Ecossaise (Beethoven): homophonic texture, strong A, 1st excerpt 

accents, repetition of short sections, detached style. 

B. 2d excerpt 

2. ''Siciliana" from A Litt/e Concert Suite (Reed): 

compound meter, legato style, melody predomi- 3d excerpt 
nantly set in woodwinds, homophonic texture. excerpt 

3. Lyric Essay ^v:oakley): legato style, polyphonic tex- p exremt 

ture, duple meter, brief baritone solo. cxccrpi 

4. "A Toyr" from Giles Famaby Suite (arr. Wiggins): 

alternating homophonic and polyphonic textures, 
duple meter, minor mode, mixture of detached and 
legato styles. 

3. Scottish Rhapsody (Grundman): colorful effects, use 

of folk song material, meter changes, unusual voic- 
ings. 

Esscyltem 

Most successfu' pieces of music show a balance between contrast and repetition. 
Why? In "Etude No. 3," how docs the composer create repetition and contrast? 

Points for grading 

Structure (13 points) 

• Perception of organization in music seems necessary for enjoyment (3 
points). 

• Repetition and contrast are essential elements of musical organization and 
structure (3 points). 

• Repetition and contrast may help define orgarazation and structure at sev- 
eral levels (3 points). 

• The elements of music used to create repetition and contrast may include 
rhythm, melody, harmony, tone color, text, and form (1 point for each 
element — 4 points total). 

Mixture of novel and known (10 points) 

• Humans seem to find stimuli that are a mixture of novel and known in- 
formation to be the most interesting (4 points). 

• Too much novel information in a musical work is often perceived as chaot- 
ic (3 points); too much repetition of known information is perceived as 
boring, dry, or academic (3 points). 
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PHYSICAL EDUCATION 



True-False Items 



F 


1. 


In shuffleboard, the fust player to score 21 points wins. 


T 


2. 


Each player in shuffleboard has four discs. 


T 


3. 


It is possible to score 10 points off in shuffleboard. 


F 


4. 


Shuffleboard must be played by four players. 


F 


5. 


In shuffleboard, there are five different areas in which to score points. 



Multiple-Choice Items 

1. In golf, a score of three on a 125-yard hole is a/an 

*A. par. 

B. bogey. 

C. birdie. 

D. eagle. 

2. The most common grip used by professional golfers is the 

A. overlapping. 
'^'B. interlocking. 

C. baseball. 

D. overlocking. 

3. In golf, which is a two-stroke penalty? 

A. hitting into the woods 

B. teeing off out of order 

C. throwing a club 

'^'D. grounding a club in a hazard 

4. In golf, an apron refers to the 

A. putting green. 

'^'B. grass located adjacent to the green. 

C. sand trap beside a green. 

D. towel used to wipe off dirty clubs. 

5. Which club would a golfer use if the ball were 50 yards from the hole? 

A. driver 

B. brassie 
*C. wedge 

D. three iron 
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Matching Items 







I 




II 


F 


1 

1. 


love 


A 

A. 


Sucncry 


D 




CAglC 


n 


DSUlIIllRiOn 


H 


a 

5» 




r 


SnUXIIC DUaid 


B 


•4. 


snuiucwvK 




gou 


c 




disc 


C. 


Dowiing 


VJ 


6. 


paddle 


F. 


teams 








G. 


table tennis 








H. 


hotseshoes 








I. 


baseball 



Essofltem 

Describe five gyn: nastic eveni^ and draw a picture of the apparatus used in each 
event. Identify a specific stunt that is performed on each apparatus. 

Points for grading 

<^ Five gymnastic events (10 points). 

• Drawing of apparatus for five gymnastic events (10 points). 

• Specific stunt tor each event (10 points). 



Sample test items weie pftpaied by Thomas L. TUcker, Lakota Schools, Rt<'> g Sun, Ohio. 
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PHYSICS 

True-False hem 

JH 1 . Fifty-six miles is a greater distance than 80 kilometers. 

JL 2. The slope of a velocity-tim^ curve (graph) defines acceleration. 

JL. 3. Constant speed and constant acceleration are the same motion. 

JL. 4. Mass is a vector quantity. 

3. Mass and weight arc measured in the same units. 

JL 6. The **order of magnitude" of the number 5.3 X 10* is four. 

-E- 7. The ^ngonometric funcdon sine is equal to the adjacent side divided by 
the hyp-)teuuse. 

JH 8. The SI unit for velocity is meters/second. 

.JL 9. Sliding friction is less than static friction. 

-21 10. It is possible for a vector of 5 units in length to be added to a vector of 6 
units in length and end up with a vector of 8 units in length. 

Multiple-Choice Items 

1. In mechanics, which three fundamental units are used to measure quantities? 

A. length, mass, and speed 

B. mass, speed, and time 
*C. mass, length, and time 

D. lengdi, speed, and timt 

2. A weight attached to a spring is vibrating up and down in simple harmonic mo- 
tion. At which pla'^e is the weight undergoing minimum acceleration? 

A. top of the vibration 
*B. middle of the vibration 

C. bottoia of the vibration 

D. same at all points 

3. How do gravitational forces compare with electrical forces? 

*A. much smaller 
b. smaller 

C. larger 

D. much larger 

4. In the /mpulse-momentum equation ft == mv, which variable is most important 
in the coach's request to "follow through"? 

A. force (f) 
♦B. time(t) 

C. mass (m) 

D. velocity (v) 

3. Which of the following is a scalar qu-^ntity? 

A. accel ation 

B. force 

C. velocity 
♦D. work 

83 8-^ 



6. The value of the gnivitational constant G was mensured about 1800 by 

A. Newton. 

B. Kepler. 
♦C. Cavendish. 

D. Brahe. 

7. Which of the following is a unit of energy? 

A. watt 

B. newton-second 

C. joule /second 
*D. joule 

8. Waves provide a means of transferring 

A. matter. 

B. liquids. 

C. particles. 
♦D. energy. 

9. Sound waves caimot travel through 

A. water. 

B. air. 

*C. vacuum. 

D. steel. 



Matching It€7ns 
I 

-1« 1. Y = 5X 



11 



A. y 



f 



_L 2. Y = 7X2 



JL 3. Y = 4 



-iL 4. Y = 3 VX 



_C_ 5. XY = 6 



B. 



D. 



E. 




1^. 



F. 
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Essay Items 



A mcrry-go-found is 42 feet in diameter. It is making 5 complete revolutions in 
two minutes, A small child slips from a horse and falls off the edge. At what speed 
does the child hit the ground? 

Points for Grading 

» G)rrect procedure (5 points). 

• Answer: 5.5 feet/secotid or 3.4 mph (5 points). 

The famous cliff divers at Acapulco, Mexico, dive from a cliff 60 meters high. 
Rocks extend outward at the bcittom of the cliff for about 20 meters. What is the 
minimum horizontal velocity the divers must have in ordei to clear the rocks 
below? 

Points far Grading 

• Correa procedure (5 points). 

• Answer: 5.7 meters/ second (5 points). 
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SCIENCE 



T?iie-False Items 

-2L 1. The sun is the center of the solar system. 
-L- 2. A light year measures time. 

-Z- 3. On a clear night, Polaris can be seen directly overhead. 

-Z- 4. The main fuel of a star is oxygen. 

.2L 3. Jupiter is the largest planet in the solar system. 



MulHpk'CL .^ice Item 

1. The sun is a/an 

"^A. middle-aged star. 

B. old star. 

C. young star. 

D. dead star. 

2. A star's energy is produced by 

A. burning. 
*B. fusion. 

C. reflection. 

D. chemical reaction. 

3. The moon revolves arour.d the earth in 

A. 24 hours. 

B. 7 days. 
♦C. 2773 days. 

D. 365 days. 

4. Meteoroids that readi the earth's surface are 

A. asteroids. 

B. comets. 

C. meteors. 
*D. meteorites. 

3. Planets are observed because they 

A. bum. 

B. give o£f their own light. 
*C. reflect the sun's lighc. 

D. reflea the earth's light. 
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Matching Items 







I 


II 


D 


1. 


a telescope that uses no light 


A. parallax 


B 


2 




B. reflecting telescope 


A 


3. 


a procedure for mea^suring star distance 


C. refracting telescope 


c 


4. 


uses a convex lens 


D. radio telescope 


H 


5. 


separates light into colors 


E. chemical reaction 



P. reflectinr inicrf pc 

G. chemical fusion 

H. spectroscope 



Essay Item 

How do astronomers know if galaxies are moving closer to or away from the 
earth? 

Points for grading 

• Doppler effect (1 point). 

• Blue-shift in spectrum— toward (1 pcint). 

• Red-shift in spectmro— away (1 point). 
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SOCIAL STUDIES 



True-False Items 

JL 1. Gcogiaphcfs have accepted the theory of * 'Continental Drift" as an ex- 
planation of past and present configurations of continents. 

2. The climate of land next to large bodies of water will change less rapidly 
than the climate farther inland. 

F 3. Humid tropical climates are found within five degrees latitude north and 
south of the Tropics of Cancer and Capricorn. 



Use the following graph to answer statements 4 and 5. 



Income Level by Education, Race, and Origin of Household 

Income 
(1,000) 

-50 

n -45 

-40 




ELEMENTARY HIGH SCHOOL COLLEGE 



White Q Hispanic @ Black I 

SOURCE: U.S. Departnient of Co'nnierce, Bureau of the Census, Statistical Abstracts of the United States 
1988 (Washington: D.C.: Government Printing Office), p. 429 (adaptation). 



4. Black, white, and Hispanic people with one to three years of college 
make over $33,000 a year. 

JL 5. Black, whirc, and Hispanic people with eight years of schooling make less 
money than persons with four years of high school* 
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Multiple-Choice Items 



1 Which of the following statements is fact rather than opinion? 
A. Federal aid to local communities increases the national debt. 
♦B. Since the 1950s, the percentage of persons over age 65 has been increas- 
ing in the United States. 

C. America's involvement in the Vietnam War ircicased young people's dis- 
trust of government. 

D. Voting for Republicans is a vote to cut federal spending. 



Read the fouowing statement and respond to item 2. 

Sue: I am so tired of reading about ciiminals who are freed by the courts 
because the police did not interrogate them properly. Courts should not 
free guilty people because of technicalities. Suspects should be required to 
answer all the questions the police ask when they are apprehended. 

Bill: I am not so sure I agree with you. After all, those "technicalities'* 
during interrogation are designed to prevenc police fkom obtaining forced 
confessions ijom people. Without rules governing inteirogation proce- 
dures, what is to prevent the police fron. just grabbing anybody and forc- 
ing the person to confess to a crime? 



2. Sue and Bill's dialogue indicates that they have a value conflict over 
A. individual guilt versus legal technicalities. 
*B. police power versus protection of the innocent. 

C. court procedpres versus public Si' ^. 

D. individual rights versus the right • ' polic** 



Essay Item 

Provide a solution to the current acid rain problem in northeastern United States 
and southeastern Canada. Explaii ^hy your solution is more efifectiv than previ- 
ously proposed approaches. 

Points for grading 

• United States and Canadian government proposals are described (20 
points). 

• Industry proposals are mentioned (10 points). 

• Student's proposal indicates a reasonable understanding of the problem 
with possible solution (20 points.) 



Sample test items were pre;) ured by Stephen Rose. Malcolm Price Laboratory School. University of 
Northern Iowa, Cedar Falls. 
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SPAIOSH 



Tme-Falseltem 

.X. 1. Cicnto is shonened to cicn before all nouns and before mil and milloncs . 

JL 2. IM is omined before ciento but is used before mil and miUcn. 

-3L 3. The word y is used between tens and units in numbers. 

-X. 4. If the direct object of a verb is a noun that refers to a definite person, the 
word a is used in front of the direct objea. 

^1. 3. When salir is followed by an object, Spanish uses the preposition a in 
fiont ofthc objea. 

JL 6. If the action in the dependent clause xAxs place in the past, the past 
subjunctive may be used in a dependent clause that follows a main clause 
verb of present, future, or present perfect tense. 

JL 7. The future, the conditional, or the present subjunctive is not used after si 
meaning '*if." 



Multiple-Choice Items 

1. If he is in his room, he's studying, 

A. Si estuviera en su cuarto, estudiaria. 

B. Si es en su cuarto, estudia. 
*C. Si estsC en su cuarto, estudia. 

D. Si hubiera estado en su cuano, est estudiando. 

2. If he were here, he would tell us. 

*A. Si estuviera aquf, nos dirfa. 

B. Si fuera aqui, nos daria. 

C. Si estsf aquf, nos dice. 

D. Si hubiera sido aqui, nos dirfa. 

3. If he had traveled more, he would have obtained the job. 

A. Si vi?.ja mis, obtendnf el pursto. 

*B. Si hubiera viajado mas, habrui obtenido el puesto. 

C. Si habfa viajado mis, hubiera obtenido el puesto. 

D. Siviajaramas, habrt obtenido el puesto. 

4. Ifl were you, I wouldn't say that. 

*A. Si fuera usted, no dirfa eso. 

B. Si estuviera usted, no dirfaeso. 

C. Siest^usted, nodirfaeso. 

D. Si fuera usted, no darfa eso. 
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Matching Items 





1 




n 


I 


t. patron saint of Mexico 


A. 


Taxco 


F 


2. Aztec capital city 


B. 


Guadalajara 


A 


3. femous silver mining city 


C. 


Chapultepec 


C 


4 t^^lace 2.nd n^rk in M^Yirn C*\tv 


D. 


Xochimilco 


J_ 


5. leader of the Mexican Revolution in 1810 


E. 


Teotihuacan 


E 


6. ancient ruins north of Mexico Citv 


F. 


Tenochtitlah 


H 


/. leatnerea serpent goci 


G. 


Chichen Ixzi 






H. 


Quetzelcoatl 






I. 


Guadalupe 






J. 


Hidalgo 



Essay Item 

Write \ dialogue in Spanish for th^ following situation: 

You wish to rent a room in a hotel and tell the dfsic clerk you wan*^ a room with 
two beds. You want three bars of soap, two towels, and comfortable beds, TTie 
room should face the street and have a good view. You ask what time meJs are 
served and tell the clerk you will stay for a week. 

Points for grading 

• Dialogue includes all items specified (8 points). 

• Present tense verbs (1 point). 

• Expression dar a (1 point). 

• Use of indefinite subject or leflexive substitute for passive voice when ask- 
ing about n;eals (1 point). 

• Use of future tense when indicating length of stav (1 point). 



Sample test items were prepared by Caiol Unc^rwood, Betmlle Local Schools, Ohio. 
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GLOSSARY OF TERMS 



Affective objectives emphasize leelings* emotions, actitudes, and values. 
In a bimodal distribution two scores tie for most frequent occurrence. 
Cognitive objectives emphasize intellectual tasks. 

Concurrent validity compares a teacher-made test with a similar assessment measure. 

Construct validity identifies the psychological traits or underlying constructs of a test. 

Content validity is the most common type of validation used by teachen t« ascertain if the 
test provides an accurate assessment of the instructional objectives. 

Equivalent forms is a method of establishing reliability by correlating scores from two dif- 
ferent» but equivalent, forms of a test. 

A frequency distribution is a tally of scores for a particular test arranged from highest to 
lowest. 

Item difficulty indicates the proportion of studenis who respond'^d correctly to a test item. 

Item dtscriminaHon provides an index of how an item discriminates between high- and low- 
scoring students. 

The Kuder-Richcrdson Formula 21 (KR21) is a method of esublishing reliability that re- 
quires a single administration of a test and uses the mean and standard deviation of the test 
scores. 

The mean is the arithmetic average of test scores. 

The median is aie middio test score in a list of scores from highest to lowest. 

The mode is the n^ost frequent test score (the score made by the largest number of stu- 
dents) in a frequency distribution. 

Negative discrimination occurs when more low- than high-scoring students respond correctly 
to a test item. 

A negatively skewed distribution occurs when the longer tail of the distribution extends to- 
ward the negative (left) end of the distribution. 

No discrin ination occurs when an equal number of high- and low-scoring students respond 
correctly to a test item. 

A normal curve repre.« .^^s a normal or symmeuical distribution of scores. 

An objective is a communicsition device that specifies the knowledge, skills, and attitudes 
expected ox students at the end of an instruvucnal unit. 

A percentut rank indicates the pettentage of students in a class who obtained raw scores be- 
low a given score. 

Positive discrimination occurs when more high- than low-scoring students respond correct'^ 
to a test item. 

A positively ji^u^y distribution occurs when t . / longer tail of the distribution extends to- 
ward the positive (right) end of the distribution. 

Prc'dicHve validity correlates test performar^ce with a future outcome. 

Psychomotor objectives emphasize skill development and require neuromuscular 
coordination. 

The range is the difference between the highest and lowest scores on a test. 
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RanA indicates the position of a score in relation to other scores. 
A fow score is the actual score that a student receives on a test. 

Reliability provides an estimate of the consistency of test results; it is expressed as a correla- 
tion coefficient reported on a scale ranging from 0,00 to LOO. 

A response profile provides an index of the frequency of student resp^^nses to test 
alternatives. 

A skewed distribution is an asynmietrical distribution with a majority of scores at one end 
of the distnbution. 

The Spearman-Brown Formula is a conection formula applied to die correlation coefficient, 
which establishes a reliability coefficient for the entire test. 

Split-half ]s a method of establishing reliability by correlating the scores from two halves of 
the same test. 

Standard deviation is a measure of dispersion that indicates how scores within a distribution 
deviate from the mean. 

A T'score represents the number of standard deviations a raw score is from the mean in a 
distribution. Unlike z-sceres, T-scores do not use decimals or negative numbers. 

Test-retest is a method of establishing reliability by correlating scores from two administra- 
tions of the same test to the same group of students in a given time interval. 

In a trimodal distribution three scores tie for most frequent occurrence. 

Validity is the extent to which a test measures what it was intended to measure. 

A Z'score represents the number of standard deviations a raw score is from the mean in a 
distribution. 



94 



BffiUOGRAPHY 



Adkins, D. C. Test Construction: Development and Interpretation of Achievement Tests, 
2d ed. Columbus, Ohio: Charles E. MerrUl, 1974. 

American Psychological Association. Standards far Educational and Psychological Testing. 
Washington, D.C.: American Psychological Association, 1986. 

Archbald, D. A., anu Newmann, F. M. Beyond Standardized Testing: Assessing Authentic 
Academic Achievement in the Secondary School. Reston, Va.: National Association of Sec- 
ondary School Principals, 1988. 

Ball, D. W. **Level of Teacher Objectives and Their Classroom Tests: Match or Mismatch." 
Journal of Social Studies Research 10, no. 2 (1986): 27-31. 

Bergman, J. Understanding Educational Measurement and Evaluation. Boston: Houghton 
Mifflin, 1981. 

Bloom, B. S.; Englehart, M. D.; Furst, E. J.; Hill, W. H.; and Krathwohl, D. R. Taxonomy 
of Educational Objectives: Handbook I Cognitive Domain. New York: David McKay, 
1956. 

Brown, F. G. Measuring Classroom Achievement. New York: Holt, RLiehart and Winston, 
1981. 

Carlson, S.B. "Creative Testing Ideas." Social Science Record 24, no. 1 (1987): 6-7. 

Carter, K. "Tcst-Wiseness for Teachers and Students." Educational Measurement: Issues 
and Practice 5, no. 4 (1986): 20-23. 

Cashin, W. E. Improving Essay Tests. Idea Paper No. 17. Manhattan, Kan.: Center for Fac- 
ulty Evaluation and Development, Division of Continuing Education, Kansas State Univer- 
sity, 1987. 

Clegg, V.L., and Cashin, W.E. Improving Multiple-Choice Tests. Idea Paper No. 16. 
Manhattan, Kan.: Center for Faculty Evaluation and Development, Division of Continuing 
Education, Kansas State University, 1986. 

Coker, D.R.; Kolstad, R. K.; and Sosa, A. H. "Improving Essay Tests: Structuring the 
Items and Scoring Responses." Clearing House 61, no. 6 (1988): 253-55. 

Denova, C.C. Test Construction far Training Evaluation. New York: Van Nostrand Rein- 
hold, 1979. 

Ebel, R. L., and Frisbie, D. A. Essentials of Educational Measurement. 4th ed. Englewood 
Cliffs, N.J.: Prentice-Hall, 1986. 

Erickson, R. C, and Wentling, T.L. Measuring Student Growth: Techniques a>:d Proce- 
dures far Occupational Education. Urbana, 111.: Griffon Press, 1988. 

Green, K.E., and Stager, S.F. "Differences in Teacher Test and Item Use with Subject, 
Grade Level Taught, and Measurement Coursework." Teacher Education and Practice 4, 
no. 1 (1987): 55-61. 

Gronlund, N.E. How to Construct Achievement Tests. 4th ed. Englewood Cliffs, N.J.: 
Prentice-Hall, 1988. 

Measurement and Evaluation in Teaching. 5th ed. New York: Macmillan, 1985. 

Stating Objectives for Classroom Instruction^ 3d ed. New York: Macmillan, 1985. 

GuUiksen, H. ^ reating Better Classroom Tests. Prirreton, N.J.: Educational Testing Ser- 
vice. 1985. 



ERLC 



94 



or; 



Ingram, C.F. Fundamentals of Educational Assessment. New York: Van Nostrand Rcin- 
hold, 1980. 

Kar*^, 1. * 'Realistic Approach to Thinking Skills: Reform Multiple-Choice Questions." 
Social Science Record 22, no. 2 (1985): 38-43. 

Kelleghan, G. F.; Madaus, G. P., and Airasian, P. W. The Effects of Standardized Testing, 
Norwell, Mass.: Kluwer Academic, 1981. 

Kirby, P. C, and Oescher, J. 'Testing for Critical Thinking: Improving Test Development 
and Evaluation Skills of Classroom Teachers." Paper presented at the annual meeting of 
the Mid-South Educational Research Association, Mobile, Alabama, November 1987. 

Kline, P. A. Handbook of Test Construction: Introduction to Psychometric Design, New 
York: Methuen, 1986. 

Kolstad, R. K., and Briggs, L. P. ''The Application of Item Analysis to Classroom Achieve- 
ment Tests." Education 105, no. 1 (1984): 70-72. 

Krathwohl, D. R., Bloom, B. S.; and Masia, B. B. Taxonomy of Educational Objectives: 
Handbook Ih Affective Domain, New York: David McKay» 1964. 

Kryspin, W. J., and Feldhusen, J. F. Developing Classroom Tests: A Guide for Writing and 
Evaluating Test Items, Minneapolis: Burgess, 1974. 

Lang, R. M. Strategies for Improving Teacher-Made Tests, DeKalb* HI.: Northern Illinois 
University, 1983. ERIC Document Reproduction Serv e. ED 247 261. 

Lcuba, R.J. "Machine-Scored Testing, Part 1: Purposes, Principles, and Practices." Engi- 
neering Education 77. no. 2 (1986): 89-95. 

Linn, R. L. Educational Measurement, 3d ed. New York; Macmillan, 1989. 

Lodish, E. "Test Writing Made Simple: Generate Tests and Worksheets ElectronicaUy." 
Electronic Learning 5, no. 5 (1986): 28, 30, 68. 

Mager, R.F. Preparing Instructional Objectives, 2d ed. Belmont, Calif.: Pitman Manage- 
ment and Training, 1984. 

Marso, R. N. "Testing P actices and Test Item Preferences of Classroom Teachers." Paper 
presented at the annual meeting of the Midwestern Educational Research Association, Chi- 
cago, October 1985. 

Marso, R. N., and Pigge, F. L. "An Analysis of Teacher-Made Tests: Testing Practices, Cog- 
nitive Demands, and Item Construction Errors." Paper presented at the annual meeting of 
the National Council on Measurement in Education, New Orleans, April 1988. 

McAuthur, D. L. Alternative Approaches to the Assessment of Achievement, Norwell, 
Mass.: Kluwer Academic, 1987. 

Miller, P. W. "Review of LXR^TEST (Version 3.1) by Logic Extension Kcsourccs,'' Journal 
of Industrial Teacher Education 26, no. 2 (1989): 63-65. 

"Developing Good Tests in Technical Education. " Jo^nw/ of Vocational Educa- 
tion 63. no. 5 (1988): 51-52. 

/'The Effects of Selected Industrial Arts Activities on Educable Mentally Retarded 

Students' Achievement and Retention of Metric Linear Concepts." Doctoral dissertation. 
The Ohio State University, 1977. Dissertation Abstracts Internationalism 4627A. University 
Microfilms No. 7731934. 

Miller, P. W., and Erickson, H. E. Teacher-lVritten Student Tests: A Guide for Planning, 
Creating, Administering, and Assessing, Washington* D.C.: National Education Associa- 
tion, 1985. 

Miller, ^. R. Instructors and Their Jobs, Homewood, III.: American Technical Publishen, 
1990. 

Nimmer, D.N. "Multiple True-False Classroom Tests." Cleanng House 56, no. 6 (1983): 
257-58. 



ERLC 



95 



••Measures of Validity, Reliability, and Item Analysis for Classroom Tests." Clear- 
ing House 58, no. 3 (1984): 138-40. 

Nitko, A.J. Educational Tests and Measurements: An Introduction. New York: Harcourt 
Brace Jovanovich, 1983. 

O'Brien, N. P. Test Construction: A Bi&liograpAy of Selected Resources, New York: Green- 
wood Press, 1988. 

Popham, J. W. Modem Educational Measurement. Englewood Cliflfe, NJ.: Prenticc-Hall, 
1981. 

Rencau, F. W. "Framing the A+ Test.'' Journal of Vocational Education 64, no. 5 (1989): 
34-35. 

Sdiocr, L. A. Test Construction: A Programmed ide. Boston: Allyn and Bacon, 1970. 

Simpson, E. T. "The Qassification of Educational Objectives in the Psychomotor Domain." 
In The Psychomotor Domain. Vol. 3. Washington, D.C.: Gryphon House, 1972. 

Smith, C. W. "100 Ways to Improve and Use Teacher-Made Tests." Illinois Schools Jour- 
nal 66, no. 3 (1987): 20-26. 

Steinmetz, J. E.; Romano, A. G.; and Patterson, M. M. "Statistical Programs for the Apple 
II Microcomputer." Behavior Research Methods and Instrumentation 13. no. 5 (1981): 702. 

Stiggins, R.J., and Bridgeford, N.J. "The Ecology of Qassroom Assessment.'' Journal of 
Educational Measurement 22, no. 4 (1985): 271-86. 

Stiggins, R.J.; Conklin, N. P.; and Bridgeford, N.J. "Classroom Assessment: A Key to Ef- 
fective Education." Educational Measurement: Issues and Practice 5, no. 2 (1986): 5-17. 

Stiggins, R.J.; Rubel, E.; and Quelhnalz, E. Measuring Thinking Skills in the Classtvom. 
Washington, D.C.: National Education Association, 1988. 

Stith, J. H., and Constantine, A. G. "Writing Better Physics Exams." Physics Teacher 26, 
no. 3 (1988): 138-44. 

Swezey, R. W. Individual Performance Assessment: An Approach to Criterion-Referenced 
Test Development. Reston, Va.: Reston Publishing, 1981. 

Thomdike, R., and Hagen, E. Measurement andEvalt ^Hon in Psychology and Education. 
New York: Macmillan, 1986. 

Tchudi, S.N., and Yates, J. Teaching Writing in the Content Areas: Senior High School. 
Washington, D.C.: National Educauon Association, 1983. 

Wilson, L. R.; Scherbeth, B. C; Brickel, H. M.; Mayo, S. T.; and Paul, R. H. Determining 
Validity and Reliability of Locally Developed Assessments— 1988. Springfield, 111.: Illinois 
Sute Board of Educauon, 1988. ERIC Document Reproducrion Service. ED 300 417. 

Wolansky, W. D. "Evaluating Student Performance in Vocational Educauon." Educational 
Horizons 65, no. 1 (1988): 42-44. 



97 

96 



